Browse
···
Log in / Register

Site Reliability Engineer (req-174)

$136,000-170,000/year

CATHEXIS

Tysons, VA, USA

Favourites
Share

Description

Team CATHEXIS elevates the government contracting experience through rapid response, deep skill, and thoughtful problem-solving and communication. Our core capabilities are our top-tier program and project management, data analytics, and audit services, the backbone of which is our integrated approach to operational excellence. You worked hard to get to where you are. You strive to make every day better than the day before. So do we. Team CATHEXIS operates with an all-in mindset. We are working together to create a company that supports our shared values and individual goals. Our values are centered around Respect, Engagement, Customer Service, Integrity, Teamwork, and Excellence in everything we do for our employees, clients, partners, and communities. We believe success is best when we listen and lead with empathy; model high standards of ethics to provide a rewarding candidate experience; work hard, have fun, and appreciate the strengths we all bring to the team; and empower our employees to create innovative and trusted results. We are looking for a dynamic Site Reliability Engineer (SRE) to join our team.  The Site Reliability Engineer (SRE) will manage, monitor, and optimize our clusters on Kubernetes. Together, we’re accelerating our clients’ digital transformation through the building and deployment of data-driven, scalable AI solutions.  The ideal candidate will have a deep understanding of Kubernetes, Cloud Infrastructure, and Infrastructure as Code (IaC) practices. You will be responsible for ensuring the reliability and scalability of our Kubernetes clusters and Cloud Infrastructure. Responsibilities: Monitor and Manage Kubernetes Clusters: Ensure the stability, health, and scalability of Kubernetes Clusters, deploying applications and services on Kubernetes Kubernetes Management: Deploy, monitor, and scale applications on Kubernetes clusters. Maintain Helm charts, manage services, and ensure resource allocation for optimal cluster performance Cloud Infrastructure Management: Work with leading Cloud Platforms (AWS, GCP, Azure) to set up, configure, and manage infrastructure resources using Infrastructure as Code (Terraform, CloudFormation, etc.) Monitoring & Incident Response: Set up monitoring solutions, define alerts, and manage the incident response process for any issues related to Jenkins, or Kubernetes clusters Automate Infrastructure Processes: Build automation tools for scaling, monitoring, and maintaining infrastructure using modern tools like Terraform, Ansible, or equivalent Collaborate Across Teams: Work closely with development, services, and operations teams to ensure a seamless integration between application development and infrastructure Security & Compliance: Ensure all systems follow best practices in terms of security and compliance with relevant regulations. This includes role-based access, encryption, and automated vulnerability scanning Requirements: Active Secret Clearance is required Bachelor’s degree (or equivalent) in computer science or related discipline A minimum of two(2) years of experience working with on-premise and off-premise cloud environments Experience with AWS, Azure and / or GCP Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript Experience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn) Proactive approach to identifying problems, performance bottlenecks, and areas for improvement Agile/Scrum experience CATHEXIS offers competitive compensation packages to all eligible employees. Our goal is to provide a compensation package that reflects the value you bring to our team, is competitive with market rates, and promotes your financial security and personal well-being. The annual salary range for this role is $136,000 - $170,000. Please note that the salary information provided is a general guideline. CATHEXIS considers various factors in its final offer, including location, qualifications, experience, and skills.  CATHEXIS is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the Recruiting@cathexiscorp.com.

Source:  workable View original post

Location
Tysons, VA, USA
Show map

workable

You may also like

Workable
Data Center Technician - Ellendale, ND - relocation expenses provided
Datacenter Hardware Technician Ellendale, ND (100% onsite) - FYI - this position wouldn't start until early December 2025. We can provide relocation assistance. This will be first shift however it will occasionally rotate (8 hours per day, 40 hours a week). Must be flexible to work varying shifts. Salary: 80-90K (pending experience) Overview: We are seeking a Datacenter Hardware Technician to support and maintain Dell server infrastructure in a high-demand, fast-paced environment. This role involves hands-on hardware troubleshooting, repairs, and installations (rack and stack), with a focus on maintaining uptime and efficiency. The ideal candidate is detail-oriented, physically capable, and comfortable working onsite within a team-based datacenter setting. Key Responsibilities: Hardware Maintenance & Repair: Perform break/fix services on Dell servers, including the replacement of components such as GPUs, NICs, memory, and other hardware. Ticket Management: Track, prioritize, and resolve hardware-related service tickets in a timely and efficient manner. Troubleshooting & Diagnostics: Identify and resolve hardware issues using knowledge of server architecture and components. Customer Service & Communication: Maintain clear, professional communication with team members and internal stakeholders to ensure smooth operations. Physical Datacenter Work: Lift up to 65 pounds, climb ladders, and carry out tasks in a physically demanding datacenter environment. Team Collaboration: Work closely with fellow technicians and other departments to meet deployment and maintenance goals. Quality & Precision: Ensure all tasks and repairs are performed to a high standard of accuracy and reliability. Additional Information: Must be a US citizen. This position requires 100% onsite presence in Ellendale, ND Shift is expected to be first shift, but will probably rotate often. Must be able to work varying shift. Role involves physical labor in a dynamic datacenter environment Requirements Dell Server Expertise: Proven experience racking, stacking, and servicing Dell servers in a datacenter setting. Break/Fix Proficiency: Hands-on experience diagnosing hardware issues and performing part replacements. Troubleshooting Skills: Strong problem-solving abilities and technical insight into server operations. Customer Focus: Excellent communication skills with a professional, customer-first approach. Physical Capability: Ability to lift heavy equipment (up to 65 lbs) and work on ladders as needed. Detail-Oriented: Committed to delivering high-quality work with strong attention to detail. Preferred Qualifications: Experience with Nvidia GPUs/NICs Basic understanding of networking concepts and troubleshooting Ability to read and interpret Linux logs for diagnostics Nice-to-Have Skills: Familiarity with the Linux command line (CLI) Exposure to RoCE (RDMA over Converged Ethernet) networking Benefits Our comprehensive benefits package for full-time salaried employees is effective immediately upon the start date. Benefits include comprehensive PPO medical coverage with access to a Health Savings Account (HSA) option, a vision plan, and dental insurance with the base dental plan option paid for by PGTEK. A TRICARE Supplemental Medical Insurance plan is also available.  Life Insurance, Short and Long-Term disability, and Critical Illness insurance have premiums covered. Additionally, PGTEK offers a matching 401(k) plan and a discount on pet insurance through ASPCA Pet Insurance. An Employee Assistance Program is available at no cost to all employees. We offer a generous amount of PTO and Holidays, and an Education Assistance Program is available after 12 months of employment. About PGTEK: PGTEK is a true consulting organization dedicated to helping clients achieve their business and technology objectives utilizing our decades of experience and business relationships. PGTEK invests in the educational advancements of our staff by providing the necessary resources to complete Professional and Business Certifications. Our company is our people, and we treat them like family.  EOE, including disability/veterans.
Ellendale, ND 58436, USA
$80,000-90,000/year
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.