Job Description: Data Engineer (Modern Data Stack / Greenfield)

HData is a leading regulatory technology company that specializes in delivering streamlined compliance and business intelligence solutions to the U.S. energy industry. Our innovative platform empowers organizations to navigate complex regulatory landscapes, optimize operations, and make data-driven decisions. As we continue to enhance our modern data stack, we are seeking a talented Data Engineer to join our team. The successful candidate will play a key role in implementing workflow orchestration, developing custom connectors for data ingestion, and leveraging dbt and Snowflake to ensure the integrity and efficiency of our data pipelines.

LOCATION:

Remote

ROLE OVERVIEW:

As a Data Engineer at HData, you will be responsible for contributing to the development and optimization of our data infrastructure. Your primary focus will be on implementing workflow orchestration, designing and building custom connectors for data ingestion, and leveraging dbt and Snowflake to enhance data reliability and scalability. 

YOU’LL THRIVE HERE IF YOU:

  • You conduct yourself with honesty, integrity, and respect in all your interactions, aligning with HData’s Core Value

  • You can communicate effectively and operate in a fast-paced, dynamic environment

  • You can build partnerships that move our business forward

  • You build code that is simple, understandable, and clean

  • You see feedback or failure as motivation to learn and grow

  • You believe data-driven decision-making is the norm

RESPONSIBILITIES:

  • Design, build, and maintain scalable and reliable data pipelines using Snowflake, dbt, and Dagster.
  • Implement and maintain dbt models to transform raw data into insightful, analytical-ready datasets.
  • Utilize web scraping techniques to extract data from online sources and integrate it into our data ecosystem.
  • Implement OCR-based data extraction to extract structured data from scanned documents and images.
  • Collaborate closely with cross-functional teams to gather requirements, understand data needs, and translate them into technical solutions.
  • Monitor and troubleshoot data pipelines, proactively identifying and resolving issues related to data ingestion, transformation, and loading.
  • Conduct data validation and testing to ensure the accuracy, consistency, and compliance of data.
  • Stay up-to-date with emerging technologies and best practices in data engineering.
  • Document data workflows, processes, and technical specifications to facilitate knowledge sharing and ensure data governance.

QUALIFICATIONS:

  • Bachelor's degree in Computer Science, Engineering, or a related field. Equivalent work experience will also be considered.
  • 3+ years experience in data engineering, ELT pipeline development, and data modeling.
  • Strong proficiency in SQL and experience with Snowflake data warehousing.
  • Hands-on experience with dbt (data build tool) for data transformation.
  • Experience with workflow orchestration tools, preferably Dagster.
  • Strong understanding of SQL and database concepts, with the ability to write efficient queries and optimize performance.
  • Strong programming skills, particularly in Python, with experience in web scraping.
  • Knowledge of cloud-based data platforms (e.g., AWS, Azure, GCP) and their associated data services.
  • Excellent problem-solving and troubleshooting skills, with a strong attention to detail.
  • Effective communication and collaboration abilities, with a proven track record of working in cross-functional teams.

PREFERRED QUALIFICATIONS:

  • Previous experience in the regulatory compliance or energy industry.
  • Understanding of NLP techniques to analyze text data and derive valuable insights for compliance and business intelligence purposes.
  • Familiarity with NLP techniques and libraries for text data analysis.
  • Familiarity with data governance frameworks and practices.
  • Strong understanding of software development principles and practices, including version control (e.g., Git) and code review processes.
  • Experience with Agile development methodologies and working in cross-functional Agile teams.
  • Ability to adapt quickly to changing priorities and work effectively in a fast-paced environment.
  • Excellent analytical and problem-solving skills, with a keen attention to detail.
  • Strong written and verbal communication skills, with the ability to effectively communicate complex technical concepts to both technical and non-technical stakeholders.

BENEFITS:

  • Medical, Dental & Vision Benefits 

  • Performance Bonus

  • 401k Retirement Plan

  • 401k Matching

  • Equity Benefit Package

  • Remote, Hybrid, & In-Office Friendly

  • Flexible PTO

  • Relocation Assistance

  • Life Insurance

  • Assortments of Discounts, Perks, and Deals

HData is committed to promoting equality, inclusion, and diversity. We’re an equal-opportunity employer of the brightest minds we can find — regardless of race, gender, age, religion, sexual orientation, or identity.

Join HData and be part of our mission to revolutionize regulatory compliance and empower the U.S. energy industry with streamlined data operations and business intelligence to help us drive innovation and deliver impactful solutions in a dynamic and challenging industry. If you’re interested in joining us, send a note to hiring@hdata.us!