Job Summary
The Comcast Cloud Team is seeking a Site Reliability Engineer (SRE), Engineer 2, to maintain, improve reliability, support, and operate our multi-region OpenStack Private Cloud environment using open-source technologies. Our platform offers virtual machines, Block Storage, Object Storage, and provide IaaS private cloud services that complement our comprehensive hybrid cloud strategy. Our OpenStack based IaaS platform is comprised of thousands of Linux hosts running tens of thousands of virtual machines. Bring your expertise in Linux systems engineering, virtualization, networking, automation, performance tuning, and troubleshooting along with a desire to solve large scale problems. Security first mentality is a must.Job Description
The Cloud Technologies Private Cloud SRE team supports private cloud platforms that internally rival public clouds in usage,performanceand efficiency. Essential to our mission is tomaximize availability, performance, andcapacity utilizationof ourplatforms.We use tools that build operational awareness for ourtenantsand our engineering teams,we scale through automation, and we continually improveour platformsin supportto provide enhanced capabilities to our tenants.
The successful candidate will implement and support operational and reliability aspects of our OpenStack infrastructure, ensuring high availability, scalability, and performance. Our ideal candidate will have extensive Linux operating system administration, possess a proficiency in Infrastructure as Code (IaC) using tools such as Ansible, Git, Terraform, Kubernetes Operators, and Python scripting language. Must have knowledge and familiarity with operational best practices, monitoring tools (Prometheus, Grafana, ELK, fishymetrics) and demonstrate a deep understanding of storage, network services, architecture, distributed services, networks and protocols, and virtualization. Experience with OpenStack, Kubernetes and Ceph storage technologies, and experience in CyberSecurity is a plus. There will be an expectation to support migration/maintenance work during off-business hours and be part of an on-call rotation with the team.
This role can be based in Philadelphia, PA; Englewood, CO; Reston, VA; or Austin, TX. It is not approved for remote or Virtual employment. We are unable to provide sponsorship for this role now or in the future.
CoreDuties and Responsibilities
- Monitor, optimize, and troubleshoot OpenStack services and infrastructure hardware to ensure high availability, performance, and security
- Stay abreast of industry trends, emerging technologies, and best practices in cloud computing and OpenStack ecosystem
- Develop and implement Infrastructure as Code (IaC) using tools such as Ansible, Terraform, and Git, ensuring automation and repeatability
- Document architecture, configurations, procedures, and troubleshooting steps for reference and knowledge sharing
- Anticipates and Interprets customer needs, assesses requirements, and identifies solutions based on best practices
- Solves complex problems, with a broad perspective to identify innovative solutions via automations
- Ensures that system failures are restored in a timely manner
- Participates in the review of failures and provide feedback to prevent future occurrences
- Consistent exercise of independent judgment and discretion in matters of significance
- On-Call support as required
Required Qualifications
- 2-4 years designing, building, and operating OpenStack environments supporting high availability (99.99% availability), low latency enterprise applications
- 3+ years proficiency in automation and DevOps tools such as Ansible, AWX, Terraform, GitHub
- 3+ years proficiency with IP Networks and networking design and operations
- 3+ years hands on experience with server hardware deployments, maintenance, and troubleshooting
- 3+ years of strong Linux administration skills
Preferred Qualifications
- Prior experience withCEPHstorage.
- Strong experience writing software that can engage with and consume data from other systems using APIs and SDKs
- Prior experience and knowledge of git and peer-review workflow
- Prior contributions to open-source software
- Past software engineering projects you can show - e.g.GitHub portfolio
- Willingness to learn by asking questions and showing initiative to learn
- Ability to actively participatein team meetings, providingaccurateand complete status updates
Employees at all levels are expected to:
- Understand our Operating Principles; make them the guidelines for how you do your job.
- Own the customer experience - think and act in ways that put our customers first, give them seamless digital options at every touchpoint, and make them promoters of our products and services.
- Know your stuff - be enthusiastic learners, users and advocates of our game-changing technology, products and services, especially our digital tools and experiences.
- Win as a team - make big things happen by working together and being open to new ideas.
- Be an active part of the Net Promoter System - a way of working that brings more employee and customer feedback into the company - by joining huddles, making call backs and helping us elevate opportunities to do better for our customers.
- Drive results and growth.
- Respect and promote inclusion & diversity.
- Do what's right for each other, our customers, investors and our communities.
Disclaimer:
- This information has been designed to indicate the general nature and level of work performed by employees in this role. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities and qualifications.
Comcast is proud to be an equal opportunity workplace. We will consider all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, genetic information, or any other basis protected by applicable law.
Skills:
OpenStack; Virtualization; Linux; Cloud Computing
Salary:
Pay Range: This job can be performed in Denver Campus with a Pay Range of $76,222.44 - $125,222.58
Comcast intends to offer the selected candidate base pay within this range, dependent on job-related, non-discriminatory factors such as experience. The application window is 30 days from the date job is posted, unless the number of applicants requires it to close sooner or later.
The application window is 30 days from the date job is posted, unless the number of applicants requires it to close sooner or later.
Base pay is one part of the Total Rewards that Comcast provides to compensate and recognize employees for their work. Most sales positions are eligible for a Commission under the terms of an applicable plan, while most non-sales positions are eligible for a Bonus. Additionally, Comcast provides best-in-class Benefits to eligible employees. We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. That's why we provide an array of options, expert guidance and always-on tools, that are personalized to meet the needs of your reality - to help support you physically, financially and emotionally through the big milestones and in your everyday life. Please visit the compensation and benefits summary on our careers site for more details.
Education
Bachelor's Degree
While possessing the stated degree is preferred, Comcast also may consider applicants who hold some combination of coursework and experience, or who have extensive related professional experience.
Relevant Work Experience
2-5 Years