← Back to all jobs
24d 11h left to apply
S

Mid-Level Data Engineer

Simple Technology Solutionsā€¢šŸŒ Remote Worldwide•Estimated: $80,000 - $120,000

✨ AI Insights & Summary

Join Simple Technology Solutions (STS), a "Best Place to Work" recognized for its commitment to employee well-being and professional growth, especially for those in HUBZones. This Mid-Level Data Engineer role is a critical opportunity to build and maintain enterprise-scale ETL pipelines on AWS for federal government clients. If you're passionate about data, thrive in an agile cloud environment, and possess strong PySpark and SQL skills, this position offers a chance to contribute to vital government functions while continuously learning and developing your expertise.

About Simple Technology Solutions

At Simple Technology Solutions, our people are our priority. We foster a culture of collaboration, continuous learning, and excellence, recognizing that our team members are multifaceted individuals. As a HUBZone company, we offer special incentives for team members living in qualified HUBZones. We are committed to equal employment opportunity and merit-based employment practices.

The Role

STS is seeking a Mid-Level Data Engineer to join a federal data engineering team. You will work alongside senior engineers building and maintaining ETL pipelines on a cloud-based Enterprise Data Platform (EDP) built on AWS. This role involves processing terabytes of financial data across a large portfolio of automated pipelines within an agile team supporting critical government functions. A willingness to learn, strong attention to detail, and a team-first mindset are essential.

Note: This position is contingent upon contract award.

Responsibilities

  • Develop new ETL pipelines and data ingestion processes using AWS Glue (Spark-based, PySpark), MWAA (Airflow), Lambda, and SNS, adhering to agency standards.
  • Integrate the agency's ETL Common Library into Glue jobs for standardized orchestration, error handling, and notifications.
  • Ingest structured and semi-structured datasets into S3 data lake zones using Apache Iceberg tables with Parquet format, ensuring transactional loading.
  • Configure static ETL metadata in a centralized PostgreSQL metadata store and ensure dynamic metadata records job status.
  • Monitor production jobs, participate in operations support rotations, and promptly identify/escalate issues.
  • Ensure ETL Load Reports are populated in real-time and ETL Gap Reports are updated weekly.
  • Build and maintain materialized views and semantic layer objects in Trino and Athena for optimized query performance.
  • Produce and maintain required documentation: Business Requirements, ETL Design Documents, Data Models (Mermaid format), Data Dictionaries, Mapping Documents, Deployment Documents, O&M Guides, and ETL Test Plans.
  • Write unit and integration tests to achieve a 90% minimum code coverage threshold and complete security scans.
  • Deploy ETL resources using CloudFormation templates through the agency CI/CD pipeline and submit Change Requests.
  • Support the transition of ETL jobs from other agency teams by verifying standards, performing deployments, and validating data loads.
  • Support disaster recovery exercises, pre-production deployments, and ad hoc data requests.
  • Participate in agile sprint ceremonies, PI planning, and backlog refinement using JIRA and GitHub.

Education and Experience

Required:

  • Bachelor's degree or higher in Computer Science, Information Systems, Data Engineering, or a related field.
  • 3-5 years of experience in data engineering or a closely related technical role.
  • Hands-on experience with Python (PEP 8), PySpark, and SQL for ETL pipeline development.
  • Experience with AWS services including Glue, S3, MWAA (Airflow), Lambda, SNS, and SQS.
  • Familiarity with Apache Iceberg, Parquet, and ORC file formats and S3 data lake zone concepts.
  • Experience with PostgreSQL and basic familiarity with Redshift or Oracle.
  • Familiarity with Trino or Athena for query and semantic layer development.
  • Experience with CloudFormation, GitHub branching workflows, and CI/CD-integrated deployments.
  • Ability to produce clear ETL documentation, including data models (Mermaid format) and data dictionaries.
  • Understanding of ETL metadata concepts.
  • Experience in agile development environments with sprint-based delivery.
  • Experience supporting IV&V and/or User Acceptance Testing (UAT) processes in a federal or technical program environment.
  • Experience with automated testing frameworks and achieving defined code coverage thresholds.
  • Familiarity with FISMA, NIST 800-53, and OWASP ASVS Level 2 is a plus.
  • Must be able to work 8 am - 5 pm Eastern Time.
  • Active federal public trust suitability determination or ability to obtain one is required.

Equal Employment Opportunity

STS is committed to equal employment opportunity and merit-based employment practices. We provide equal opportunities to all employees and applicants and prohibit discrimination and harassment.

Apply Now

This job is active but will expire soon. Click below to apply on the company's website.

Apply for this role ↗

Share Job

Know someone who would be a perfect fit? Share this opportunity.

Job Overview

Posted6/14/2026
CategoryData Engineering
SourceJobsCollider

FAQ

Is this position remote?

The Mid-Level Data Engineer role is a remote opportunity. The location specified is Remote Worldwide.

What is the salary?

The salary is not explicitly stated, but is competitive and based on experience.

How do I apply?

You can apply by clicking the "Apply for this role" button above to submit your application on the hiring website.

Similar Opportunities

Plain Concepts

Senior Data Engineer

Plain Concepts•Spainā€¢šŸ  Remote
Competitive
Data Engineering
View Job →
M

Data Engineer

MUTT DATA•Remote Worldwideā€¢šŸ  Remote
Competitive
Data Engineering
View Job →
3

Senior Data Engineer (PowerBI & Data Integration)

3Pillar•Remote Worldwideā€¢šŸ  Remote
Competitive
Data Engineering
View Job →