This site uses cookies. To find out more, see our Cookies Policy

Manager IT Platform Operations in Reston, VA at Beacon Health Options

Date Posted: 5/14/2019

Job Snapshot

Job Description

Beacon Health Options is building next generation cloud infrastructure with a specific focus on enterprises workloads. We are looking for a seasoned Site Reliability Engineering (SRE) expert to lead initiatives focused on systems infrastructure management within a high volume, fast scaling Cloud environment. If you are currently leading a Site Reliability Engineering function in a hyper-scale cloud (IaaS, PaaS or SaaS) then we are eager to talk to you. 


  • You will serve as a technical lead for architecting and delivering all of the Operations/SRE services and processes. The job requires getting your hands dirty, troubleshooting infrastructure, and architecting data centers, using your existing knowledge and toolkits.
  • Continuously analyze the current Site Reliability capabilities and identify areas of improvements.
  • Identify, define, and implement new tools and technologies for improving the quality and efficiency of Beacon’s distributed platform.
  • You will serve as a leader and coach of a team of SREs responsible for automated infrastructure deployment, ongoing operation and monitoring of our Cloud infrastructure, working closely with the development teams.
  • You will drive reliability and supportability aspects of Cloud service, including change management, triage of customer escalations, remediation plans, playbooks and automations.


  • 10+ years of experience of increasing responsibility in data center operations roles, including team and process management responsibilities.
  • A minimum of 3+ years of experience in leading SRE/DevOps functions, preferably having been an early team member in a cloud service company that has experienced rapid growth and scale.
  • You have worked in a hyper-scale cloud (e.g. Azure, AWS, GCP), or a SaaS/PaaS company as a technical lead and architect.
  • Prior successful experience of working in an innovative, fast-paced team. The candidate must demonstrate strong entrepreneurial spirit and vigor.
  • Prior experience as a cloud-native and microservices software developer in a DevOps function for continuous integration and delivery (CI/CD) is a strong plus.
  • Prior experience designing, deploying and managing VMware SDDC platforms is a strong plus.
  • Expertise with a significant set of cloud technologies (e.g. Containers, Docker, Kubernetes, Elastic, Logstash, Kibana, Kafka, Consul, Cassandra, etc.) is a MUST.
  • Experience in cloud provisioning code development and tools (Azure Management API, GCP API, Terraform)
  • Virtualization technologies, in particular VMware product suite (vSphere, VSAN, NSX, vROps) is a strong plus.
  • Deep understanding of data center networking, including Software Defined Networking (SDN) and network architecture of Azure/AWS/GCP is a strong plus.
  • Experience operating Large-scale (Linux) production environments, preferably as part of an online service provider environment.
  • Scripting proficiency, such as Python, and data center automation tools.


  • Bachelor’s Degree in Computer Science, Engineering or a related technical discipline, or the equivalent combination of education, technical training, or work/military experience.