Browse
···
Log in / Register

Machine Learning Engineer, ML Runtime & Optimization

$140,000-250,000/year

pony.ai

Fremont, CA, USA

Favourites
Share

Description

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and inferences of the AI models in autonomous driving systems. This includes: Identifying key applications for current and future autonomous driving problems and performing in-depth analysis and optimization to ensure the best possible performance on current and next-generation compute architectures. Collaborating closely with diverse groups in Pony.ai including both hardware and software to optimize and craft core parallel algorithms as well as to influence the next-generation compute platform architecture design and software infrastructure. Apply model optimization and efficient deep learning techniques to models and optimized ML operator libraries. Work across the entire ML framework/compiler stack (e.g.Torch, CUDA and TensorRT), and system-efficient deep learning models. Requirements BS/MS or Ph.D in computer science, electrical engineering or a related discipline. Strong programming skills in C/C++ or Python. Experience on model optimization, quantization or other efficient deep learning techniques Good understanding of hardware performance, regarding CPU or GPU execution model, threads, registers, cache, cost/performance trade-off, etc. Experience with profiling, benchmarking and validating performance for complex computing architectures. Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks. Strong communication skills and ability to work cross-functionally between software and hardware teams Preferred Qualifications: One or more of the following fields are preferred Experience with parallel programming, ideally CUDA, OpenCL or OpenACC. Experience in computer vision, machine learning and deep learning. Strong knowledge of software design, programming techniques and algorithms. Good knowledge of common deep learning frameworks and libraries. Deep knowledge on system performance, GPU optimization or ML compiler. Compensation and Benefits Base Salary Range: $140,000 - $250,000 Annually Compensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units. Also, we provide the following benefits to the eligible employees: Health Care Plan (Medical, Dental & Vision) Retirement Plan (Traditional and Roth 401k) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation & Public Holidays) Family Leave (Maternity, Paternity) Short Term & Long Term Disability Free Food & Snacks

Source:  workable View original post

Location
Fremont, CA, USA
Show map

workable

You may also like

Workable
Cloud Software Engineer, Backend
Headquartered in the United States, TP-Link Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked as the world’s top provider of Wi-Fi devices. The company is committed to delivering innovative products that enhance people’s lives through faster, more reliable connectivity. With a commitment to excellence, TP-Link serves customers in over 170 countries and continues to grow its global footprint. We believe technology changes the world for the better! At TP-Link Systems Inc, we are committed to crafting dependable, high-performance products to connect users worldwide with the wonders of technology.  Embracing professionalism, innovation, excellence, and simplicity, we aim to assist our clients in achieving remarkable global performance and enable consumers to enjoy a seamless, effortless lifestyle.  Overview: Do you thrive in building robust, scalable backend systems for the cloud? Are you passionate about designing and implementing high-performance, secure applications? If so, then this Cloud Software Engineer, Backend role might be perfect for you! In this role, you will be an important member of our engineering team, responsible for the design, development, and maintenance of our backend cloud applications. You'll collaborate with our Cloud Architects, UX/UI designers, QA teams, and key hardware personnel and you will leverage your expertise in cloud technologies and backend development to build secure, reliable, and performant systems that meet our business needs. Key Responsibilities: Research competitors, design, develop, and maintain highly scalable and reliable backend services using cloud-native technologies. Collaborate with frontend engineers and other teams to ensure seamless integration. Select and utilize appropriate cloud technologies (e.g., AWS, Azure, OCI) to build and deploy applications. Optimize backend systems for performance, scalability, and cost-efficiency. Implement robust security measures to ensure data protection and application integrity. Develop various software (e.g., Windows/Linux/Hardware Box) with the same code which is in DDD architecture. Write clean, maintainable, and well-documented code. Automate infrastructure provisioning and deployment using tools like CI/CD pipelines. Troubleshoot and debug complex backend issues. Participate in code reviews & design reviews that are related to the modules that you are responsible for. Stay up to date on the latest cloud technologies and best practices. Requirements Bachelor's degree in Computer Science, Software Engineering, or a related field. 2+ years of experience in cloud backend software development. Proven experience designing and building scalable, reliable, and secure cloud-based applications. Strong proficiency in backend programming language Java and frameworks like SpringBoot, Service Mesh. Strong proficiency in JVM, multithreading programming, performance optimization skills. Experience with a specific cloud platform (AWS, Azure, OCI). Experience with cloud-based databases (e.g., NoSQL databases such as MongoDB, relational databases). Experience with distributed middleware (e.g. Message queue, config center). Experience with containerization technologies (e.g., Docker, Kubernetes). In-depth knowledge of backend technologies (e.g., APIs, message queues). Experience with security best practices for cloud environments. Excellent communication, collaboration, and problem-solving skills. Ability to work effectively in a team environment. Preferred Qualifications: Experience with DevOps principles and practices. Experience with serverless architectures. Experience with SDN Software Controller Platform. Experience with AI Ops. Benefits Salary range: $120,000 - $180,000 Free snacks and drinks, and provided lunch on Fridays Fully paid medical, dental, and vision insurance (partial coverage for dependents) Contributions to 401k funds Bi-annual reviews, and annual pay increases Health and wellness benefits, including free gym membership Quarterly team-building events At TP-Link Systems Inc., we are continually searching for ambitious individuals who are passionate about their work. We believe that diversity fuels innovation, collaboration, and drives our entrepreneurial spirit. As a global company, we highly value diverse perspectives and are committed to cultivating an environment where all voices are heard, respected, and valued. We are dedicated to providing equal employment opportunities to all employees and applicants, and we prohibit discrimination and harassment of any kind based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Beyond compliance, we strive to create a supportive and growth-oriented workplace for everyone. If you share our passion and connection to this mission, we welcome you to apply and join us in building a vibrant and inclusive team at TP-Link Systems Inc. Please, no third-party agency inquiries, and we are unable to offer visa sponsorships at this time.
Irvine, CA, USA
$120,000-180,000/year
Workable
Java Developer
Role: Java Developer Location: Philadelphia, Philadelphia, PA Mandatory skills: Java, Spring, Jenkins, Tomcat, Python, MySQL, GIT, Test Automation Framework Development Bachelor's degree (or foreign equivalent) in Computer Science, Engineering, or a related technical field Define and develop test automation using the following: Java J2EE, Spring, Jenkins, Tomcat, Python, MySQL, GIT and Maven Deploy using Agile or Scrum in a DevOps environment. Additional advantage having +1 year of experience or domain in developing in video and broadband product architecture utilizing RDK/TDK suite. Key Job Responsibility: +5 years DevOps Practices: •Full Lifecycle Ownership: Oversee solutions through their entire lifecycle, from design to production deployment. •Comprehensive Solutioning: Design, develop, execute, train, operationalize, monitor, and triage end-to-end solutions. Focus on enhancing process quality and predictability. •Observability and Oversight: Improve observability to ensure comprehensive oversight of the end-to-end release readiness process. •Process Optimization: Develop and implement process improvements to boost team velocity and efficiency. •Collaboration and Integration: Work closely with other engineers, integrating business and functional priorities into solutions. Act as a key contributor in complex, high-stakes environments. •Operationalization: Successfully operationalize solutions in production to ensure seamless functionality and impact. 2.+5 years BDD and Test Automation Expertise: •Have a good understanding of Business-Driven Development strategy. •Explore advanced topics such as test data management, scenario refactoring, and integrating BDD with continuous integration/continuous deployment (CI/CD) pipelines. •Develop, and execute high-quality test plans efficiently on RDK stacks while leveraging automation and tooling to detect defects quickly and ensure the reliability of test coverages. 3. +5 years Tooling and Test Automation Infrastructure: •Get familiar with automation tools and frameworks for test development, execution, and reporting. •Maintaining test automation, including selecting appropriate tools, configuring environments, and integrating with version control systems. 4. +5 years Reliability Engineering and Process Optimization: •Explore strategies for optimizing validation coverage for home setups, standalone. •Practice DevOps to reduce the time between testbed delivery and acceptance by the receiving teams with reliability metrics 5. Collaboration and Communication Skills: •Effectively communicate and collaborate within the team and with stakeholders. •Practice agile methodologies to enhance teamwork and productivity. •Participate in cross-functional activities such as joint sprint planning, backlog grooming, and retrospective meetings to foster alignment and shared understanding across teams. 6. Continuous Learning and Improvement: •Practice continuous learning and improvement for ongoing skill development and career growth. •Stay updated on industry trends and best practices. •Put the team first. Value individual growth and collective success. 7. Own the customer experience - think and act in ways that put our customers first, give them seamless digital options at every touchpoint, and make them promoters of our products and services. Know your stuff - be enthusiastic learners, users and advocates of our game-changing technology, products, and services, especially our digital tools and experiences. 8. Win as a team - make big things happen by working together and being open to new ideas. Be an active part of the Net Promoter System - a way of working that brings more employee and customer feedback into the company - by joining huddles, making call backs, and helping us elevate opportunities to do better for our customers. Drive results and growth. Respect and promote inclusion & diversity. Do what's right for each other, our customers, investors, and our communities.
Philadelphia, PA, USA
Negotiable Salary
Workable
Frontend Engineer
About Agent Vista Agent Vista is a modern, tech-forward insurance agency that runs on an AI-native brokerage platform. Our mission is to leverage the latest technology and innovation in the market to create better outcomes for our agents, customers, and local communities. We are reimagining and simplifying the “agent desktop”. This allows our agents to spend 100% of their time on what matters most–the customer. Our platform evolves based on agent, customer, and carrier behavior, with natural language interfaces that make complex workflows feel effortless. If you're passionate about building cutting-edge AI and agentic workflows to solve real-world problems and re-shape an industry that impacts millions, Agent Vista is your next big move. Role Overview We're seeking an experienced Frontend Engineer to architect and build modern, responsive web applications that serve as the customer-facing component of our AI-native insurance agency platform. You'll create intuitive interfaces that leverage our AI capabilities to simplify complex insurance processes for agents and policyholders alike. In this role, you'll collaborate cross-functionally with Product, Engineering, Business, Data, Finance, and Operations teams to scale operational effectiveness in our high-growth environment. What You'll Do Collaborate with product and design teams to integrate AI-driven features (e.g., autocomplete, summarization, natural language queries) directly into the front end. Leverage tools like  Spring AI to orchestrate client-agent interactions within enterprise workflows Utilize agent-based IDEs (e.g. Cursor) to accelerate front-end and full-stack development workflows, achieving significant gains in development velocity and reducing time-to-deploy for GenAI features Design and develop intuitive, high-performance enterprise web applications using Next.js, React, Tailwind CSS, and related technologies Implement robust testing strategies including component, integration, and end-to-end tests using Cypress, Jest, and React Testing Library Build responsive interfaces that work seamlessly across devices and browsers Collaborate closely with product managers, designers, and backend engineers to transform requirements into exceptional user experiences Design scalable frontend architecture with modular components and minimal technical debt Optimize application performance including load times, rendering efficiency, and overall user experience Implement and maintain CI/CD workflows using Git, GitHub Actions, or similar tools Write clean, maintainable code following best practices and industry standards Participate in code reviews and provide constructive feedback to team members Requirements What You Bring Bachelor's degree in Computer Science or related fields (or equivalent practical experience) 5+ years of experience building web applications with React and related technologies Extensive experience with modern JavaScript (ES6+) and TypeScript Strong command of HTML, CSS, and CSS frameworks like Tailwind CSS Comfort working with streaming and real-time AI outputs (e.g., token-level updates, WebSockets, server-sent events). Proficiency in modern front-end frameworks such as React (preferred), Vue, or Svelte, with deep understanding of component-based architecture and state management. Experience integrating AI/ML APIs (e.g., OpenAI, Amazon Bedrock,  Anthropic) into production web applications. Understanding of human-AI interaction principles, including how to present, explain, and control AI-generated content responsibly. Experience with automated testing frameworks (Cypress, Jest, React Testing Library) Familiarity with CI/CD pipelines and deployment processes Proficiency using version control systems (Git) and collaborative development workflows Experience with responsive design and cross-browser compatibility Strong problem-solving abilities and attention to detail Preferred Qualifications Experience with Next.js framework and server-side rendering concepts Knowledge of state management solutions (Redux, Zustand, Context API, etc.) Experience integrating with AI/ML features or working with LLM-powered applications Understanding of performance optimization techniques for web applications Experience with GraphQL or RESTful API integration Familiarity with cloud services (AWS, Vercel, Azure, GCP) Experience working in a regulated industry (insurance, finance, healthcare) Experience with micro-frontend architecture What Makes You Stand Out Previous experience in Insurtech or Fintech domains Experience implementing AI/ML solutions in production environments Experience implementing analytics and user tracking systems Understanding of design systems and component libraries Benefits At Agent Vista, we’re reimagining the insurance experience from the ground up—making it faster, smarter, and more human. We’re a tight-knit, driven team that believes in the power of technology, great design, and personal relationships to solve real-world problems. What sets us apart is our culture: collaborative, supportive, and entrepreneurial. Your ideas matter here. You’ll have room to take ownership, move fast, and make a measurable impact on both the company and the industry. We Offer: Competitive compensation and equity  Flexible, hybrid-friendly work environment  Comprehensive health, dental, and vision insurance  401(k) plan  Generous PTO and paid holidays  Support for professional development and growth  A culture of trust, transparency, and autonomy  Opportunities to shape the future of Insurtech  If you’re excited to help build something from the ground up—and have some fun along the way—you’ll fit right in.
Framingham, MA, USA
Negotiable Salary
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.