posted Mar 20

Senior Data Engineer

Airflow Apache AWS Azure Cloud GCP Java Kafka Numpy Pandas Python Scala Spark SQL senior

Job Location: Remote

Job Description

• Build infrastructure for ingestion, transformation, and loading an exponentially increasing volume of data from a variety of sources using Spark, SQL, AWS, and Databricks • Building an organic entity resolution framework capable of correctly merging hundreds of billions of individual entities into a number of clean, consumable datasets • Developing CI/CD pipelines and anomaly detection systems capable of continuously improving the quality of data we're pushing into production • Devising solutions to largely-undefined data engineering and data science problems • Work with stakeholders in Engineering and Product to assist with data-related technical issues and support their infrastructure needs

Qualifications

• 5-7+ years industry experience with clear examples of strategic technical problem solving and implementation • Strong software development fundamentals • Experience with Python Expertise with Apache Spark (Java, Scala, and/or Python-based) • Experience with SQL • Experience building scalable data processing systems (e.g., cleaning, transformation) from the ground up • Experience using developer-oriented data pipeline and workflow orchestration (e.g., Airflow (preferred), dbt, dagster or similar) • Knowledge of modern data design and storage patterns (e.g., incremental updating, partitioning and segmentation, rebuilds and backfills) • Experience working in Databricks (including delta live tables, data lakehouse patterns, etc.) • Experience with cloud computing services (AWS (preferred), GCP, Azure or similar) • Experience with data warehousing (e.g., Databricks, Snowflake, Redshift, BigQuery, or similar) • Understanding of modern data storage formats and tools (e.g., parquet, ORC, Avro, Delta Lake)

Benefits

• Stock • Competitive Salaries • Unlimited paid time off • Medical, dental, & vision insurance • Health, fitness, and office stipends • The permanent ability to work wherever and however you want

logo
Company
Terakeet
Post Date
New
Title
Sr. Data Scientist
Type
$107,000 - $162,000 a year
Location
Remote
logo
Company
Alphatec Spine
Post Date
New
Title
Senior Data Security Engineer
Type
$130,000 - $150,000 a year
Location
Unknown, California
logo
Company
Samsara
Post Date
New
Title
Marketing Data Engineer
Type
$95,200 - $160,000 a year
Location
Remote
logo
Company
Redwood Materials
Post Date
New
Title
Senior Data Engineer
Location
Remote
logo
Company
Amyris
Post Date
New
Title
Senior Data Engineer
Type
$110,000 - $130,000 a year
Location
Unknown, California