Lead Data Engineer Job at WorkHQ, Los Angeles, CA

RWd1SWl2UFhxZjNvbXUzT2JPdW1CUkFYa3c9PQ==
  • WorkHQ
  • Los Angeles, CA

Job Description

Company Context

Series A, well-funded US startup in HRTech developing WorkHQ.com and an AI Recruiter product.

This is a US-only, Remote role (Mainland).

Role Overview

Lead data infrastructure architect managing billions of data points across 250M+ professional profiles.

Hire data engineers to aid you in that journey.

Core Responsibilities

  • Design scalable data pipelines processing massive record volumes

  • Architect ETL processes using PySpark on Amazon EMR (Open to shifting to other solutions like Data Bricks / Snowflake)

  • Distribute enriched data through medallion architecture across Postgres, Athena, OpenSearch

  • Integrate new data sources into the main pipeline

  • Implement advanced data matching using Splink

Technical Requirements

  • 5-8 years professional data engineering experience

  • Good proficiency in:

    • PySpark and distributed computing

    • AWS data services (EMR, Glue, Athena)

    • Docker

    • Pandas and DataFrame manipulation

    • Complex data format handling (JSONL, Parquet)

  • Strong background in:

    • Big data processing architectures

    • Data warehouse design

    • Performance optimization

  • Advanced Python, SQL skills

Nice to Have

  • Probabilistic record linking expertise

  • OpenSearch/elasticsearch technologies

  • Machine learning data pipeline design

  • Recruitment tech ecosystem knowledge

Technical Stack

  • Big Data: PySpark, EMR

  • Databases: Postgres, OpenSearch

  • Cloud: AWS

  • Containerization: Docker

  • Data Formats: JSONL, Parquet

  • Analytics: Metabase, Athena, Glue

  • Data Processing: Pandas, Splink

Other Considerations

While this role has specific requirements - if you lack a few technical skills, but motivated to learn and lead the platform, please apply for consideration.

If you are coming from Director/Head of/VP levels that is relevant to this job, you can apply as well.

You will need to apply directly on our platform.

Thank you for your time.

Job Tags

Permanent employment, Shift work,

Similar Jobs

Terra-Gen Operating Company, LLC.

Wind Turbine Technician Job at Terra-Gen Operating Company, LLC.

 ...located in several western states. Current renewable projects include wind, geothermal and solar. Terra-Gen is currently seeking...  ...Mojave and Desert Hot Springs, California TGOC is seeking wind turbine technicians with 2-3 years, G.E. 1.5 or VESTAS wind turbine experience... 

Planet Group

Medical Writer III Job at Planet Group

 ...Target PR Range: 57-67/hr *Depending on experience Position Summary: The primary responsibilities of an Associate Director Medical Writer atinclude, but are not limited to, leading key medical writing projects and activities across therapeutic areas and preparing... 

Park Place Hotel

Breakfast Server Job at Park Place Hotel

Job Summary: The Breakfast Server role provides quality friendly service to our hotel guests and locals. Description of the role:As a Breakfast Server at Park Place Hotel in Traverse City, Michigan, you will be responsible for setting up and serving breakfast to guests... 

Google

Research Analyst, Future of Work Job at Google

 ...seeks out the solution.As a Research Analyst, you will draw on specific domain expertise (such as economics, social psychology, anthropology), augmenting it with wide-ranging curiosity and a capacity for redefining the practice of research in an AI-powered age.The US... 

Castle Rock Construction Company of Colorado, LLC.

CDL Drivers Class A and Class B Job Job at Castle Rock Construction Company of Colorado, LLC.

CDL Drivers Class A and Class B JobTrucking Job SummaryCome Drive for an exciting Heavy Highway Construction Company that has been in business for over a hundred years. Great Pay, Great benefits, 401K, Health, Dental and Vison. Pave your way to success by joining our...