Browse
···
Log in / Register

Senior Site Reliability Engineer

$140,000-180,000/year

TP-Link Systems Inc.

Irvine, CA, USA

Favourites
Share

Description

At the forefront of the future of connected living, TP-Link's Systems Inc. R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networking, IoT smart home products, and software services. Our team of passionate engineers are constantly innovating, engineering solutions that transform the end user experience with simpler, smarter, and more reliable connectivity. We're looking for a passionate and experienced Senior Site Reliability Engineer to join our team and play a crucial role in ensuring our cloud platform's security, Reliability, scalability, and operational excellence. About Us: Headquartered in the United States, TP-Link Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked as the world’s top provider of Wi-Fi devices. The company is committed to delivering innovative products that enhance people’s lives through faster, more reliable connectivity. With a commitment to excellence, TP-Link serves customers in over 170 countries and continues to grow its global footprint. We believe technology changes the world for the better! At TP-Link Systems Inc, we are committed to crafting dependable, high-performance products to connect users worldwide with the wonders of technology.  Embracing professionalism, innovation, excellence, and simplicity, we aim to assist our clients in achieving remarkable global performance and enable consumers to enjoy a seamless, effortless lifestyle.  Responsibilities: Serve as technical SME for implementing and operating Microservices on Kubernetes cloud-based platforms.  Collaborate with the Cloud Technical Development and DevOps teams to deploy services to the Multi-Cloud Platform.  Performing Load Tests and Chaos Tests to ensure the scalability and reliability of microservices. Build Observability for Microservices and cloud platforms like AWS, OCI, Azure, and GCP. Write and Execute the Disaster recovery plans in collaboration with the Development and DevOps team. Analyze and resolve production risks caused by insufficient resources, such as node groups, CPU, memory, HPA scheduling, JVM pre-warming, etc. Write and maintain scripts for automation using languages like Python, Go, or Bash. Define and maintain the KPIs (SLA/SLO/SLI) for all cloud microservices with development teams to better understand the business. Create and maintain technical documentation, including architecture diagrams, design documents, and standard operating procedures. Guarantee adherence to security and compliance standards, including ISO27001, SOC2, and GDPR. Lead incident response efforts to troubleshoot and resolve production issues quickly. Perform post-incident analysis to identify root causes and potential workarounds/solutions. Assist with product/technology selection, including implementation of POCs Be fluid and open to change and evolving processes and tools  Help to mentor and train less senior members of the team Ability to be part of On-call rotation and provide support after work hours and on weekends.  Other duties as assigned Requirements Bachelor's degree in Computer Science, Information Technology, or a related field. 5+ years of experience as a Site Reliability Engineer. Proficiency in programming and scripting languages like Java, Python, Bash, or PowerShell. Hands-on experience in SRE, DevOps, cloud operations, and cloud security best practices. Strong knowledge of security technologies, including Identity and access management, Network security, Application security, and Data protection. Strong problem-solving and analytical skills, with the ability to work independently and as part of a team. Experience in developing and maintaining technical documentation and implementing compliance requirements. Additional Skills (Preferred): Expert-level cloud certifications include AWS Solutions Architect, Professional, Azure Solutions Architect Expert, and GCP Professional Cloud Architect. Experience with container orchestration technologies (e.g., Kubernetes). Benefits Salary range: $140,000 - $180,000 Free snacks and drinks, and provided lunch on Fridays Fully paid medical, dental, and vision insurance (partial coverage for dependents) Contributions to 401k funds Bi-annual reviews, and annual pay increases Health and wellness benefits, including free gym membership Quarterly team-building events At TP-Link Systems Inc., we are continually searching for ambitious individuals who are passionate about their work. We believe that diversity fuels innovation, collaboration, and drives our entrepreneurial spirit. As a global company, we highly value diverse perspectives and are committed to cultivating an environment where all voices are heard, respected, and valued. We are dedicated to providing equal employment opportunities to all employees and applicants, and we prohibit discrimination and harassment of any kind based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Beyond compliance, we strive to create a supportive and growth-oriented workplace for everyone. If you share our passion and connection to this mission, we welcome you to apply and join us in building a vibrant and inclusive team at TP-Link Systems Inc. Please, no third-party agency inquiries, and we are unable to offer visa sponsorships at this time.

Source:  workable View original post

Location
Irvine, CA, USA
Show map

workable

You may also like

Craigslist
Client Liaison (Remote Position)
Making a difference: Digital Cheetah has provided cutting edge volunteer and member management solutions to some of the largest not-for-profits ( https://www.digitalcheetah.com/clients/) in the world for 24 years. We strive to create an exciting, challenging and rewarding work environment. Join a team of dedicated industry veterans with vast experience on the forefront of technology innovation to build software with a purpose. Description: This is a full-time, remote position open for immediate hire. As a member of our implementations team, you will work with external clients and internal team members to make a real difference for leading non-profits throughout the world. Following an agile methodology, you will manage the account relationship with clients through an interactive requirement gathering process, project planning, monitoring project workflow, and managing client expectations and scope. You will have exposure to a wide variety of roles, challenges, and technologies and will have the opportunity to learn best practice skills in an expanding company with numerous possibilities for personal and career advancement. Remote Position: Our team works remotely using Slack, Zoom, Microsoft Teams, JIRA, Confluence, and other collaboration tools. Job Responsibilities: Day-to-day project management to ensure on-time delivery by applying Agile theory, practices, and rules in a matrix-managed scrum environment. Manage internal and external communication, improves transparency, and effectively disseminates information between teams. Facilitate and perform requirements gathering and discovery for new projects. Build an efficient and trusting scrum environment with an emphasis on problem solving; facilitate discussion, conflict resolution, decision making, and getting the work done. Manage client expectations by building relationships; communicating project status and open issues; preparing reports; conducting planning and retrospective meetings; discovering future feature enhancements. Implement solutions by monitoring project progress; tracking action items; conducting design and implementation reviews; examining, researching, and resolving issues; escalating issues to designated personnel; identifying work process improvements. Assist in removing roadblocks, impediments, and assures work is proceeding according to schedule. Generate development tickets in JIRA for feature requests, bug fixes, and enhancements from our clients to our internal development team. Manage the UAT process with the client including troubleshooting UAT findings and working with appropriate resources to resolve them. Facilitate a seamless handoff to the product support team once a project launches. Participate in the following sprint ceremonies: Release Planning, Daily Scrum, Sprint Planning, Sprint Demo and Retrospectives. Provide product advice, best practice consulting, and product demonstrations to clients. Job Qualifications: Project Management, Client Relationship Management, General Consulting Skills, Presenting Technical Information, Technical Understanding, Teamwork, Problem Solving. Manage time well and handle multiple projects simultaneously. High attention to detail. Flexible and open to learning new things. Solid client communication required. Work well in collaborative team environment. Comfortable using technology and explaining it to others. Creative and efficient troubleshooting and problem-solving skills are a must. Familiarity with Agile development environment is a plus. Apply Online: https://www.digitalcheetah.com/client-liaison/
1101 Fieldcrest Dr, Austin, TX 78704, USA
Negotiable Salary
Workable
Sr GCP Engineer
Infrastructure Automation & Management:   Design, implement, and maintain scalable, reliable, and secure cloud infrastructure using GCP services. Automate cloud infrastructure provisioning, scaling, and monitoring using Infrastructure as Code (IaC) tools such as Terraform or Google Cloud Deployment Manager. Manage and optimize GCP resources such as Compute Engine, Kubernetes Engine, Cloud Functions, and BigQuery to support development teams. CI/CD Pipeline Management: Build, maintain, and enhance continuous integration and continuous deployment (CI/CD) pipelines to ensure seamless and automated code deployment to GCP environments. Integrate CI/CD pipelines with GCP services like Cloud Build, Cloud Source Repositories, or third-party tools like Jenkins Ensure pipelines are optimized for faster build, test, and deployment cycles. Monitoring & Incident Management: Implement and manage cloud monitoring and logging solutions using Dynatrace and GCP-native tools like Stackdriver (Monitoring, Logging, and Trace). Monitor cloud infrastructure health and resolve performance issues, ensuring minimal downtime and maximum uptime. Set up incident management workflows, implement alerting mechanisms, and create runbooks for rapid issue resolution.   Security & Compliance:   Implement security best practices for cloud infrastructure, including identity and access management (IAM), encryption, and network security. Ensure GCP environments comply with organizational security policies and industry standards such as GDPR/CCPA, or PCI-DSS. Conduct vulnerability assessments and perform regular patching and system updates to mitigate security risks. Collaboration & Support: Collaborate with development teams to design cloud-native applications that are optimized for performance, security, and scalability on GCP. Work closely with cloud architects to provide input on cloud design and best practices for continuous integration, testing, and deployment. Provide day-to-day support for development, QA, and production environments, ensuring availability and stability. Cost Optimization: Monitor and optimize cloud costs by analyzing resource utilization and recommending cost-saving measures such as right-sizing instances, using preemptible VMs, or implementing auto-scaling.   Tooling & Scripting: Develop and maintain scripts (using languages like Python, Bash, or PowerShell) to automate routine tasks and system operations. Use configuration management tools like Ansible, Chef, or Puppet to manage cloud resources and maintain system configurations. Required Qualifications & Skills: Experience: 3+ years of experience as a DevOps Engineer or Cloud Engineer, with hands-on experience in managing cloud infrastructure. Proven experience working with Google Cloud Platform (GCP) services such as Compute Engine, Cloud Storage, Kubernetes Engine, Pub/Sub, Cloud SQL, and others. Experience in automating cloud infrastructure with Infrastructure as Code (IaC) tools like Terraform, Cloud Deployment Manager, or Ansible.   Technical Skills: Strong knowledge of CI/CD tools and processes (e.g., Jenkins, GitLab CI, CircleCI, or GCP Cloud Build). Proficiency in scripting and automation using Python, Bash, or similar languages. Strong understanding of containerization technologies (Docker) and container orchestration tools like Kubernetes. Familiarity with GCP networking, security (IAM, VPC, Firewall rules), and monitoring tools (Stackdriver). Cloud & DevOps Tools: Experience with Git for version control and collaboration. Familiarity with GCP-native DevOps tools like Cloud Build, Cloud Source Repositories, Artifact Registry, and Binary Authorization. Understanding of DevOps practices and principles, including Continuous Integration, Continuous Delivery, Infrastructure as Code, and Monitoring/Alerting.   Security & Compliance: Knowledge of security best practices for cloud environments, including IAM, network security, and data encryption. Understanding of compliance and regulatory requirements related to cloud computing (e.g., GDPR/CCPA, HIPAA, or PCI). Soft Skills: Strong problem-solving skills with the ability to work in a fast-paced environment. Excellent communication skills, with the ability to explain technical concepts to both technical and non-technical stakeholders. Team-oriented mindset with the ability to work collaboratively with cross-functional teams. Certifications (Preferred): Google Professional Cloud DevOps Engineer certification (preferred). Other GCP certifications such as Google Professional Cloud Architect or Associate Cloud Engineer are a plus. DevOps certifications like Certified Kubernetes Administrator (CKA) or AWS/GCP DevOps certification are advantageous.
Springfield, MO, USA
Negotiable Salary
Workable
Machine Learning Engineer
Tiger Analytics is an advanced analytics consulting firm. We are the trusted analytics partner for several Fortune 100 companies, enabling them to generate business value from data. Our consultants bring deep expertise in Data Science, Machine Learning, and AI. Our business value and leadership have been recognized by various market research firms, including Forrester and Gartner. Are you a Machine Learning Engineer with expertise in Google Cloud Platform (GCP) and Vertex AI? We are looking for two talented professionals to join our team in a fully remote, onshore capacity. If you thrive in building and deploying scalable AI solutions, this role is for you! What You'll Do: Collaborate with cross-functional teams to design and deploy ML models. Develop reusable, scalable code for AI/ML applications. Leverage GCP services to build end-to-end machine learning pipelines. Optimize models for performance and scalability using Vertex AI. Requirements Key Requirements: Google Cloud Platform (GCP) Experience: Strong proficiency in GCP services, including data engineering and machine learning tools. Google Vertex AI Expertise: Hands-on experience with model training, deployment, and optimization using Vertex AI. Model Development & Deployment: Proven ability to design, build, and productionize machine learning models. API Development: Skilled in developing robust APIs for seamless integrations. Python Programming with CI/CD: Experience in Python-based applications and implementing CI/CD pipelines. Why Join Us? Work remotely while contributing to cutting-edge projects. Collaborate with a dynamic team passionate about AI/ML innovation. Opportunity to work with the latest Google Cloud technologies. Ready to take the next step? Apply now and be part of a team that’s shaping the future of AI! Benefits Significant career development opportunities exist as the company grows. The position offers a unique opportunity to be part of a small, fast-growing, challenging and entrepreneurial environment, with a high degree of individual responsibility.
Dallas, TX, USA
Negotiable Salary
Craigslist
Software Engineer (redwood city)
NimbleRx, Inc. in Redwood City, California seeks a Software Engineer. Responsibility: Responsible for designing, building, and implementing software solutions to optimize and scale the Nimble platform and related technologies. Duties include: developing highly scalable software using innovative computer science and software engineering principles; creating product features and designing easy-to-use APIs, systems, and tools while ensuring proper protocols are in place to rapidly roll out upgrades and new features; developing a user-friendly interface using React, Java Spring, and PostgreSQL; building internal tools to support performance, reliability, and scalability; and other duties as assigned. Salary range $162,000 - $182,000. Eligible for standard benefits. Employee may be stationed anywhere within commuting distance of our Redwood City, CA office and will work in office 2 days per week. Education: Master’s degree in Computer Science, Software Engineering, or related field (or foreign equivalent). Requirements: 1 year of experience in the job offered or related. Other special requirements include: 1 year of experience working with Java, Ruby, Python, or similar programming languages; 1 year of experience developing API’s, services, and platform development; 1 year of experience developing user-facing applications using React, Flutter, or similar; and 1 year of experience working with distributed codebases on cross-functional teams. Please mail resumes to Attn: Eva Lee 2317 Broadway, Redwood City, CA 94063 quoting job #NIMSE25.
691 Winslow St, Redwood City, CA 94063, USA
$162,000-182,000/year
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.