Infrastructure Engineer - Meteorological Development (Contract)
Join The Weather Network and its subsidiaries (MeteoMedia, ElTiempo.es) as an Infrastructure Engineer, focusing on Meteorological Development. This contract role, extending until February 2027, is crucial for delivering highly available, scalable, and high-performance systems that support 4 billion global locations with quality data. We are recognized as #5 Most Admired Company in Canada in Legerβs 2026 Reputation Study.
Workplace Perks:
- Flexible Work Environment: π»
- Summer Hours: Start your weekends earlier during summer months. π
- Paid Vacation Days Off: In addition to Personal Days. π
- IDEAS Team: Committed to Inclusion, Diversity, Equity, Awareness, and Solidarity. βοΈ
About the Role:
This role combines infrastructure engineering, platform reliability, and software development within a modern cloud-native environment. You will design and operate distributed systems on Kubernetes (AWS EKS & GCP GKE) with Istio service mesh, AWS ECS using Docker containers, infrastructure as code (Terraform & Terragrunt), and GitOps/CI/CD pipelines (Jenkins). You'll work across AWS and/or GCP services, manage application delivery with Helm, and support stateful and caching layers like Redis (AWS ElastiCache). Ensuring system reliability through Linux-based operations and observability tooling (Prometheus, Grafana, Kibana, ElasticSearch, Jaeger, Kiali) is key. You will implement and maintain playbooks and runbooks using bash scripts and Ansible. Collaborating closely with technical leads, developers, operations, and infrastructure administrators, you will modernize and standardize infrastructure through automation, observability, and cloud/platform engineering best practices. Background in meteorology is a plus but not required; you will work closely with our Meteorology R&D team.
Day-to-Day:
- Design and implement core meteorological infrastructure services running on Kubernetes (AWS EKS) with Istio, EC2 instances, ECS Clusters, and deployed via Jenkins CI/CD pipelines using Helm, SSM, and bash scripts. Utilize AWS services like S3, SQS, SNS, FSx, ECS, EC2, Elasticache, and ALB/NLB. Improve monitoring systems using Prometheus, Grafana, and Kibana.
- Leverage AI-assisted development tools and workflows to enhance productivity, accelerate delivery, improve code quality, support troubleshooting, and streamline operational tasks.
- Provide technical advice and produce effort estimates for required work, executing tasks based on priorities.
- Collaborate with cross-functional teams to ensure systems function as intended and without impediment to existing products or processes, including the development and integration of monitoring systems.
- Work with the Technology Operations team to ensure end-to-end data monitoring from acquisition to delivery.
- Utilize knowledge of databases and database technology to create distributed, multi-tier applications capable of servicing large data volumes.
- Provide support to operational teams for production systems, ensuring service availability and stability.
- Maintain operational runbooks, automation playbooks, and reliable cloud infrastructure procedures for production systems.
What We're Hoping You Have:
- Degree or diploma in Computer Science, Engineering, Mathematics, or equivalent practical experience.
- Minimum 3+ years of experience designing, implementing, and managing containerized environments using Docker and Kubernetes across platforms like CloudStack, Proxmox, EKS & GKE.
- Strong knowledge of cloud platforms and infrastructure technologies including AWS, GCP, CloudStack, and Proxmox.
- Experience with core networking concepts (TCP/IP, NAT, DNS, load balancing, firewalls).
- Strong experience building and supporting production-grade distributed systems in Linux environments.
- Experience developing production applications in Go, Python, C/C++, or Rust.
- Strong scripting and infrastructure automation experience using Python, Bash, Terraform, Helm, Terragrunt, Groovy, or JavaScript.
- Understanding of data structures, algorithms, and performance optimization.
- Experience designing and supporting highly available, scalable systems handling large data volumes.
- Experience with relational and NoSQL databases, including caching technologies like Redis or AWS ElastiCache.
- Experience with CI/CD practices and tools (Jenkins).
- Experience building and consuming REST-based APIs and services.
- Familiarity with security best practices and secure infrastructure design.
- Experience with monitoring and observability tools (Prometheus, Grafana, ElasticSearch, Kibana, Jaeger, Kiali).
- Understanding of cloud cost optimization.
- Strong problem-solving and communication skills.
- Experienced in maintaining operational runbooks, automation playbooks, and reliable cloud infrastructure procedures.
- Experience with AI/ML infrastructure or platforms is an asset.
What Would Be Nice For You To Have:
- Keen interest in data and data mining, problem-solving.
- Experience with spatiotemporal data sets and scientific data formats (netCDF, HDF, GRIB, BUFR).
- Experience with Ansible.
- Education or experience in applying ML/AI to augment data analysis.
You Are Made For This If You:
- Have strong analytical, problem-solving, and troubleshooting skills.
- Bring curiosity and creative thinking to complex technical challenges.
- Thrive in fast-paced environments and manage priorities effectively.
- Can work independently while delivering on commitments.
- Are an engaged, collaborative team player.
- Have a natural curiosity about data, especially meteorological data.
- Possess strong written and verbal communication skills.
- Have strong organizational skills and value clear documentation.
Important Information:
- This is a contract role until February 2027.
- Candidates must be eligible to work in Canada and able to work in a Hybrid model at our Oakville, ON, Canada location.
- Pelmorex values diverse experiences and encourages applicants with alternative experiences to apply. We seek "culture add" rather than "culture fit."