Is the Mid-Level Data Engineer position remote, hybrid, or on-site?

The Mid-Level Data Engineer role at Simple Technology Solutions is a remote opportunity. The location specified by the employer is Remote Worldwide.

What is the salary range for the Mid-Level Data Engineer role at Simple Technology Solutions?

The salary for Mid-Level Data Engineer at Simple Technology Solutions is not explicitly stated, but is competitive and based on experience.

How do I apply for the Mid-Level Data Engineer position?

You can apply directly by visiting the dynamic application link on FutureTalent at: https://www.futuretalent.online/jobs/9277-mid-level-data-engineer-simple-technology-solutions.

Mid-Level Data Engineer Job at Simple Technology Solutions | Remote Opportunity

✨ AI Insights & Summary

Join Simple Technology Solutions (STS), a "Best Place to Work" recognized for its commitment to employee well-being and professional growth, especially for those in HUBZones. This Mid-Level Data Engineer role is a critical opportunity to build and maintain enterprise-scale ETL pipelines on AWS for federal government clients. If you're passionate about data, thrive in an agile cloud environment, and possess strong PySpark and SQL skills, this position offers a chance to contribute to vital government functions while continuously learning and developing your expertise.

About Simple Technology Solutions

At Simple Technology Solutions, our people are our priority. We foster a culture of collaboration, continuous learning, and excellence, recognizing that our team members are multifaceted individuals. As a HUBZone company, we offer special incentives for team members living in qualified HUBZones. We are committed to equal employment opportunity and merit-based employment practices.

The Role

STS is seeking a Mid-Level Data Engineer to join a federal data engineering team. You will work alongside senior engineers building and maintaining ETL pipelines on a cloud-based Enterprise Data Platform (EDP) built on AWS. This role involves processing terabytes of financial data across a large portfolio of automated pipelines within an agile team supporting critical government functions. A willingness to learn, strong attention to detail, and a team-first mindset are essential.

Note: This position is contingent upon contract award.

Responsibilities

Develop new ETL pipelines and data ingestion processes using AWS Glue (Spark-based, PySpark), MWAA (Airflow), Lambda, and SNS, adhering to agency standards.
Integrate the agency's ETL Common Library into Glue jobs for standardized orchestration, error handling, and notifications.
Ingest structured and semi-structured datasets into S3 data lake zones using Apache Iceberg tables with Parquet format, ensuring transactional loading.
Configure static ETL metadata in a centralized PostgreSQL metadata store and ensure dynamic metadata records job status.
Monitor production jobs, participate in operations support rotations, and promptly identify/escalate issues.
Ensure ETL Load Reports are populated in real-time and ETL Gap Reports are updated weekly.
Build and maintain materialized views and semantic layer objects in Trino and Athena for optimized query performance.
Produce and maintain required documentation: Business Requirements, ETL Design Documents, Data Models (Mermaid format), Data Dictionaries, Mapping Documents, Deployment Documents, O&M Guides, and ETL Test Plans.
Write unit and integration tests to achieve a 90% minimum code coverage threshold and complete security scans.
Deploy ETL resources using CloudFormation templates through the agency CI/CD pipeline and submit Change Requests.
Support the transition of ETL jobs from other agency teams by verifying standards, performing deployments, and validating data loads.
Support disaster recovery exercises, pre-production deployments, and ad hoc data requests.
Participate in agile sprint ceremonies, PI planning, and backlog refinement using JIRA and GitHub.

Education and Experience

Required:

Bachelor's degree or higher in Computer Science, Information Systems, Data Engineering, or a related field.
3-5 years of experience in data engineering or a closely related technical role.
Hands-on experience with Python (PEP 8), PySpark, and SQL for ETL pipeline development.
Experience with AWS services including Glue, S3, MWAA (Airflow), Lambda, SNS, and SQS.
Familiarity with Apache Iceberg, Parquet, and ORC file formats and S3 data lake zone concepts.
Experience with PostgreSQL and basic familiarity with Redshift or Oracle.
Familiarity with Trino or Athena for query and semantic layer development.
Experience with CloudFormation, GitHub branching workflows, and CI/CD-integrated deployments.
Ability to produce clear ETL documentation, including data models (Mermaid format) and data dictionaries.
Understanding of ETL metadata concepts.
Experience in agile development environments with sprint-based delivery.
Experience supporting IV&V and/or User Acceptance Testing (UAT) processes in a federal or technical program environment.
Experience with automated testing frameworks and achieving defined code coverage thresholds.
Familiarity with FISMA, NIST 800-53, and OWASP ASVS Level 2 is a plus.
Must be able to work 8 am - 5 pm Eastern Time.
Active federal public trust suitability determination or ability to obtain one is required.

Equal Employment Opportunity

STS is committed to equal employment opportunity and merit-based employment practices. We provide equal opportunities to all employees and applicants and prohibit discrimination and harassment.

Mid-Level Data Engineer

✨ AI Insights & Summary

About Simple Technology Solutions

The Role

Responsibilities

Education and Experience

Equal Employment Opportunity

Apply Now

Share Job

Job Overview

FAQ

Is this position remote?

What is the salary?

How do I apply?

Similar Opportunities

Senior Data Engineer

Data Engineer

Senior Data Engineer (PowerBI & Data Integration)