Software Engineer, Data
Company: HeyGen
Location: Los Angeles
Posted on: April 1, 2026
|
|
|
Job Description:
About HeyGen At HeyGen, our mission is to make visual
storytelling accessible to all. Over the last decade, visual
content has become the preferred method of information creation,
consumption, and retention. But the ability to create such content,
in particular videos, continues to be costly and challenging to
scale. Our ambition is to build technology that equips more people
with the power to reach, captivate, and inspire audiences. Learn
more at www.heygen.com . Visit our Mission and Culture doc here .
Position Summary A Software Engineer with data engineering
responsibilities to bridge the gap between core application
development and large-scale data infrastructure. You will help
build the data foundational layers for our next-generation
features. This role is not just about moving data—it’s about
enabling AI models to function in real-time, building robust
pipelines for multimedia, and powering engaging user experiences.
This team is currently working on cutting-edge features including
PPT-to-video converters and interactive, conversational video
capabilities. Core Responsibilities Build & Scale Data Pipelines:
Design, develop, and maintain robust batch and real-time data
pipelines (using Python, Go, Spark, Kafka) that ingest and
transform massive multi-modal data—text, audio, and video—to train
and run AI models. Power Intelligent Features: Collaborate with ML
engineers to implement data structures and APIs for new, exciting
features like PPT-to-video automation and interactive AI avatars
that require low-latency data fetching. Data Lakehouse
Infrastructure: Architect and manage data lakehouse solutions
(e.g., Snowflake, Databricks, Apache Iceberg) to store and query
unstructured media data efficiently, enhancing storage and
computation efficiency. Data Reliability & Observability: Implement
data quality checks, data contracts, and monitoring to ensure high
reliability of data, preventing downtime in production video
generation. Productize Data: Transform raw data into structured,
actionable data products that can be easily consumed by front-end
applications, API endpoints, and AI agents. Qualifications
Bachelor’s/Master’s degree in Computer Science, Engineering, or a
related field. 3-5 years of experience as a Backend Software
Engineer with heavy data processing responsibilities. Strong
proficiency in Python (for ETL/scripting) and SQL (for data
modeling). Experience with cloud platforms (AWS/GCP) and data
technologies like Kafka, Spark, and Snowflake/Databricks.
Experience or interest in Computer Vision/Generative AI data
processing. Proactive, "owner" mindset; ability to operate in a
fast-paced, startup environment. What HeyGen Offers Competitive
salary and benefits package. Dynamic and inclusive work environment
focused on innovation and creativity. Opportunities for
professional growth and skill development. Collaborative culture
that values teamwork and employee input. Access to state-of-the-art
technologies and tools. Salary Range $180,000 – $220,000 equity
benefits Please note that the salary information is a general
guideline only. HeyGen considers factors such as scope and
responsibilities of the position, candidate's work experience,
education/training, key skills, and internal equity, as well as
location, market and business considerations when extending an
offer. As part of our total rewards package, HeyGen offers
comprehensive benefits including equity, a 401k plan, health
benefits, generous PTO, a parental leave program and emotional
health resources. HeyGen is an Equal Opportunity Employer. We
celebrate diversity and are committed to creating an inclusive
environment for all employees. Join us at HeyGen and be part of a
team that's reshaping the world of video creation through
innovative technology!
Keywords: HeyGen, Simi Valley , Software Engineer, Data, Engineering , Los Angeles, California