posted Mar 29

Data Analyst

Apache AWS Azure Cloud GCP Numpy Pandas PySpark Python Spark SQL mid

Job Location: Remote

Job Description

• Analyzing data using statistical techniques and tools to identify anomalous data, clean data, and derive meaningful insights and trends. • Ensuring data integrity, accuracy, and completeness throughout the analysis process. • Generate, maintain, and update dashboard reports using our business intelligence tool, highlighting key findings and trends. • Develop and maintain databases, data systems, and data analytics pipelines within database management systems. • Work with stakeholders in Engineering, Product, and Revenue to assist with data-related technical issues and support their data infrastructure and analytics needs. • Ensure the integrity and consistency of database schemas, including managing updates, version control, and documenting schema changes to support data analysis and reporting requirements. • Responsible for assistance and further development of our quality assurance process.

Qualifications

• 3-5+ years industry experience with clear examples of strategic and analytical technical problem solving and implementation • Strong software development and analytics fundamentals • Expertise in with SQL & Python • Experience with Apache Spark or PySpark • Experience with data cleaning and data processing (e.g., cleaning, transformation) using SQL and Python. • Knowledge of modern data design and storage patterns (e.g., incremental updating, partitioning and segmentation, rebuilds and backfills) • Experience working in Databricks (including delta live tables, data lakehouse patterns, etc.) • Experience with cloud computing services (AWS (preferred), GCP, Azure or similar) • Experience with data warehousing (e.g., Databricks, Snowflake, Redshift, BigQuery, or similar) • Understanding of modern data storage formats and tools (e.g., parquet, ORC, Avro, Delta Lake) • Must thrive in a fast paced environment and be able to work independently • Can work effectively remotely (able to be proactive about managing blockers, proactive on reaching out and asking questions, and participating in team activities) • Strong written communication skills on Slack/Chat and in documents • You are experienced in writing data design docs (pipeline design, dataflow, schema design) • You can scope and breakdown projects, communicate and collaborate progress and blockers effectively with your manager, team, and stakeholders • Experience collaborating with Product and Engineering teams

Benefits

• Stock • Competitive Salaries • Unlimited paid time off • Medical, dental, & vision insurance • Health, fitness, and office stipends • The permanent ability to work wherever and however you want

logo
Company
CDC Foundation
Post Date
New
Title
Prospect Research Analyst
Type
$69,273 - $102,000 a year
Location
Remote
logo
Company
Jerry
Post Date
New
Title
Senior Marketing Analyst, Paid Search
Location
San Francisco, California
logo
Company
Twin Health
Post Date
New
Title
Lead Business Intelligence Developer (Tableau)
Type
$140,000 - $160,000 a year
Location
Remote
logo
Company
CDC Foundation
Post Date
New
Title
Data Analyst, Analytics & Visualization
Location
Remote
logo
Company
MongoDB
Post Date
New
Title
Staff Data Analyst, Marketing
Type
$112,000 - $220,000 a year
Location
Manhattan, New York