Observability Platform Engineering Technical Lead
Job Description
Job Description:
Note: Fidelity will not provide immigration sponsorship for this position.
The Role
We are seeking a highly experienced, hands-on technical leader and platform builder. You will report to the engineering lead and will be responsible for delivering core platform capabilities within Fidelity's Enterprise Observability Platform. You'll own system architecture, observability, security, and delivery while collaborating closely with product owners and engineering teams.
What you'll do (responsibilities):
- Design and implement highly scalable, fault-tolerant distributed systems and services that operate at large scale and low latency.
- Lead the architecture and build of high-throughput data streaming pipelines (real-time/event streaming), including ingestion, processing, and durable storage.
- Develop and own observability for systems: metrics, tracing, structured logging, dashboards, alerts and SLOs.
- Implement application instrumentation and collaborate with platform teams to standardize telemetry and monitoring practices.
- Use AWS native services (MSK, Lambda, ECS/EKS, EC2, S3, DynamoDB, RDS, IAM, CloudWatch, etc.) to deliver robust solutions.
- Design and enforce secure networking and firewall architectures (VPCs, subnets, security groups, NACLs, load balancers, private endpoints).
- Write production-quality code and tests in at least one major language (Python, Java, Go, or Node.js). Own CI/CD pipelines and release automation.
- Drive projects end-to-end: translate product requirements, create execution plans, identify risks, and coordinate across product, security, infra, and operations teams.
- Mentor engineers, run design reviews, and improve team practices around reliability, scalability, and operability.
The Expertise and Skills You Bring:
- Strong understanding of distributed systems fundamentals: consensus, partitioning, replication, consistency models, leader election, backpressure, and fault tolerance.
- Demonstrated experience designing and operating highly available, highly scalable production systems.
- Deep knowledge of observability: metrics, tracing, logging formats specially OpenTelemetry , alerting, and SLO/SLI design.
- Experience implementing application instrumentation libraries and sidecars; familiarity with sampling, tagging, and context propagation.
- Solid networking knowledge: TCP/IP, load balancing, NAT, DNS, VPC design, security groups, firewalls, and TLS.
- Proven experience building solutions using AWS native services (design patterns and tradeoffs).
- Experience designing and building real-time/high-speed streaming pipelines capable of processing large volumes of data; familiarity with Kafka, Kinesis, Flink, Spark Streaming, or similar.
- Hands-on coding: able to implement, debug, and ship production code; strong test discipline and experience with CI/CD.
- Experience with building containerized applications using Docker and container orchestration (Kubernetes/EKS).
- Excellent written and verbal communication; able to drive projects with product managers and stakeholders.
Nice-to-have
- Experience with observability platforms like Grafana, Prometheus, Datadog, or Splunk.
- Background in performance tuning, JVM internals, or low-latency systems.
- Experience with infrastructure-as-code (Terraform, CloudFormation).
- Experience building multi-region or global systems.
Certifications:
Category:
Information TechnologyPlease be advised that Fidelity's business is governed by the provisions of the Securities Exchange Act of 1934, the Investment Advisers Act of 1940, the Investment Company Act of 1940, ERISA, numerous state laws governing securities, investment and retirement-related financial activities and the rules and regulations of numerous self-regulatory organizations, including FINRA, among others. Those laws and regulations may restrict Fidelity from hiring and/or associating with individuals with certain Criminal Histories.
Job Description:
Note: Fidelity will not provide immigration sponsorship for this position.
The Role
We are seeking a highly experienced, hands-on technical leader and platform builder. You will report to the engineering lead and will be responsible for delivering core platform capabilities within Fidelity's Enterprise Observability Platform. You'll own system architecture, observability, security, and delivery while collaborating closely with product owners and engineering teams.
What you'll do (responsibilities):
- Design and implement highly scalable, fault-tolerant distributed systems and services that operate at large scale and low latency.
- Lead the architecture and build of high-throughput data streaming pipelines (real-time/event streaming), including ingestion, processing, and durable storage.
- Develop and own observability for systems: metrics, tracing, structured logging, dashboards, alerts and SLOs.
- Implement application instrumentation and collaborate with platform teams to standardize telemetry and monitoring practices.
- Use AWS native services (MSK, Lambda, ECS/EKS, EC2, S3, DynamoDB, RDS, IAM, CloudWatch, etc.) to deliver robust solutions.
- Design and enforce secure networking and firewall architectures (VPCs, subnets, security groups, NACLs, load balancers, private endpoints).
- Write production-quality code and tests in at least one major language (Python, Java, Go, or Node.js). Own CI/CD pipelines and release automation.
- Drive projects end-to-end: translate product requirements, create execution plans, identify risks, and coordinate across product, security, infra, and operations teams.
- Mentor engineers, run design reviews, and improve team practices around reliability, scalability, and operability.
The Expertise and Skills You Bring:
- Strong understanding of distributed systems fundamentals: consensus, partitioning, replication, consistency models, leader election, backpressure, and fault tolerance.
- Demonstrated experience designing and operating highly available, highly scalable production systems.
- Deep knowledge of observability: metrics, tracing, logging formats specially OpenTelemetry , alerting, and SLO/SLI design.
- Experience implementing application instrumentation libraries and sidecars; familiarity with sampling, tagging, and context propagation.
- Solid networking knowledge: TCP/IP, load balancing, NAT, DNS, VPC design, security groups, firewalls, and TLS.
- Proven experience building solutions using AWS native services (design patterns and tradeoffs).
- Experience designing and building real-time/high-speed streaming pipelines capable of processing large volumes of data; familiarity with Kafka, Kinesis, Flink, Spark Streaming, or similar.
- Hands-on coding: able to implement, debug, and ship production code; strong test discipline and experience with CI/CD.
- Experience with building containerized applications using Docker and container orchestration (Kubernetes/EKS).
- Excellent written and verbal communication; able to drive projects with product managers and stakeholders.
Nice-to-have
- Experience with observability platforms like Grafana, Prometheus, Datadog, or Splunk.
- Background in performance tuning, JVM internals, or low-latency systems.
- Experience with infrastructure-as-code (Terraform, CloudFormation).
- Experience building multi-region or global systems.
Certifications:
Category:
Information TechnologyPlease be advised that Fidelity's business is governed by the provisions of the Securities Exchange Act of 1934, the Investment Advisers Act of 1940, the Investment Company Act of 1940, ERISA, numerous state laws governing securities, investment and retirement-related financial activities and the rules and regulations of numerous self-regulatory organizations, including FINRA, among others. Those laws and regulations may restrict Fidelity from hiring and/or associating with individuals with certain Criminal Histories.
About Fidelity Investments
At Fidelity, since our founding in 1946, we have been dedicated to strengthening and security our clients’ financial well-being through exceptional service and innovative solutions. We empower over ~50 million people to achieve their most important financial goals, manage employee benefit programs for nearly 24,000 businesses, and support more than 16,000 wealth management firms and institutions with cutting-edge investments and technology. Our diverse business portfolio and independence provide us with a comprehensive view of the market and the stability to deliver long-term value for our customers. As the financial industry evolves and customer needs grow more complex, Fidelity continues to reinvent, innovate, and transform to meet the challenges of tomorrow’s financial landscape.
*Specifically serviced by our Clearing & Custody team within Fidelity Institutional
Fidelity TalentSource, is the in-house temporary staffing provider for Fidelity Investments. Unlike traditional staffing agencies, we are an internal business unit within Fidelity’s Talent Acquisition team, dedicated to recruiting talent from various backgrounds for roles in Fidelity’s regional and investor center locations. Our mission is to help you experience Fidelity’s diverse and inclusive workplace while expanding your skill set and professional network, with the ultimate goal of conversion to full-time employment as part of Fidelity’s long-term strategy. To learn more about temporary positions at Fidelity Investments, visit FidelityTalentSource.com.