Browse
···
Log in / Register

Machine Learning Engineer, ML Runtime & Optimization

$140,000-250,000/year

pony.ai

Fremont, CA, USA

Favourites
Share

Description

Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and inferences of the AI models in autonomous driving systems. This includes: Identifying key applications for current and future autonomous driving problems and performing in-depth analysis and optimization to ensure the best possible performance on current and next-generation compute architectures. Collaborating closely with diverse groups in Pony.ai including both hardware and software to optimize and craft core parallel algorithms as well as to influence the next-generation compute platform architecture design and software infrastructure. Apply model optimization and efficient deep learning techniques to models and optimized ML operator libraries. Work across the entire ML framework/compiler stack (e.g.Torch, CUDA and TensorRT), and system-efficient deep learning models. Requirements BS/MS or Ph.D in computer science, electrical engineering or a related discipline. Strong programming skills in C/C++ or Python. Experience on model optimization, quantization or other efficient deep learning techniques Good understanding of hardware performance, regarding CPU or GPU execution model, threads, registers, cache, cost/performance trade-off, etc. Experience with profiling, benchmarking and validating performance for complex computing architectures. Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks. Strong communication skills and ability to work cross-functionally between software and hardware teams Preferred Qualifications: One or more of the following fields are preferred Experience with parallel programming, ideally CUDA, OpenCL or OpenACC. Experience in computer vision, machine learning and deep learning. Strong knowledge of software design, programming techniques and algorithms. Good knowledge of common deep learning frameworks and libraries. Deep knowledge on system performance, GPU optimization or ML compiler. Compensation and Benefits Base Salary Range: $140,000 - $250,000 Annually Compensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units. Also, we provide the following benefits to the eligible employees: Health Care Plan (Medical, Dental & Vision) Retirement Plan (Traditional and Roth 401k) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation & Public Holidays) Family Leave (Maternity, Paternity) Short Term & Long Term Disability Free Food & Snacks

Source:  workable View original post

Location
Fremont, CA, USA
Show map

workable

You may also like

Workable
Freelance Software Developer (C/C++ - Rust) - AI Trainer
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. What we do The Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe. About the Role GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Analyze and understand existing code in Python or C/C++ Migrate logic to idiomatic, safe Rust while preserving functionality Adapt or port the test suite and ensure behavioral equivalence Document migration steps and technical decisions How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone. Requirements You have a Bachelor's or Master’s degree in Software Development, Computer Science, or other related fields.  You have at least 3 years of professional experience with C/C++ and 1+ year of hands-on experience with Rust. You are experienced with FFI tools (bindgen, cxx) and unsafe Rust for C/C++interoperability. You bring experience testing migrated code (unit/integration/fuzz tests). You demonstrate solid understanding of systems programming (memory management, concurrency). You are skilled at refactoring legacy code and documenting migration steps. Prompt engineering experience is a strong plus. Your level of English is advanced (C1) or above. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge. Benefits Why this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $50/hour depending on your skills, experience, and project needs. Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.
New York, NY, USA
$50
Craigslist
Enroll in the Software Boot Camp Online Today and Land a Tech Job
The Tech Academy delivers cost-effective and self-paced online coding boot camps that are tailored for beginners with no prior technical or coding knowledge. Our certification programs thoroughly cover in-demand skills for the tech industry, are endorsed by stellar online reviews and designed to fit around your personal schedule to prepare you for your tech career with a well-rounded tool kit. We have been offering thorough, budget-friendly, flexible, and trusted coding boot camps for over a decade. Founded in 2014, The Tech Academy specializes in certifying students in a wide range of technical specialties, including: AI, coding, cybersecurity, data science, app development, design, web development, and more. After your coding boot camp, our job placement specialists will provide you with career guidance. Our team has successfully placed over 1,000 graduates in technical positions, with most making an average of more than $30/hour in their first job after graduation. Here is an overview of The Tech Academy's certification programs: 1. FLEXIBLE SCHEDULING & SELF-PACED TRAINING 2. BEGINNER-FRIENDLY COURSES 3. WELL-ROUNDED & THOROUGH TRAINING 4. AFFORDABLE & BUDGET-FRIENDLY TUITION 5. OVER 1,000 5-STAR REVIEWS ONLINE 6. JOB PLACEMENT TRAINING & ASSISTANCE The Tech Academy’s online certification programs start at $5,980, with multiple tuition financing options available. Start your journey into the technology industry today with one of our award-winning online coding boot camps! Find out more here by contacting us here: https://thetechacademy.us Your dream job in tech is just a Tech Academy boot camp away!
J36J+4X Honalo, HI, USA
$30/hour
Workable
Machine Learning Researcher / Engineer (Foundational Models)
About Pathway Pathway is building LiveAI™ systems that think and learn in real time as humans do. Our mission is to deeply understand how and why LLMs work, fundamentally changing the way models think. The team is made up of AI luminaries. Pathway's CTO, Jan Chorowski, co-authored papers with Geoff Hinton and Yoshua Bengio and was one of the first people to apply attention to speech. Our CSO, Adrian Kosowski, received his PhD in Theoretical Computer Science at the age of 20 and made significant contributions across numerous scientific fields, including AI and quantum information. He also served as a professor and a coach for competitive programmers at Ecole Polytechnique. The team also includes numerous world's top scientists and competitive programmers, alongside seasoned Silicon Valley executives. Pathway has strong investor backing. To date, we have raised over $15M; our latest reported round was our seed. Our offices are located in Palo Alto, CA, as well as Paris, France, and Wroclaw, Poland. The Opportunity This is an R&D position in attention-based models. We are currently searching for 1 or 2 R&D Engineers with a strong track record in machine learning models research. This is an extremely ambitious foundational project. There is a flexible GPU budget associated with this specific project, guaranteed to be in the 7-digit range minimum. You Will perform (distributed) model training. help improve/adapt model architectures based on experiment results. design new tasks and experiments. optionally: oversee activities of team members involved in data preparation. The results of your work will play a crucial role in the success of the project. Requirements Cover letter It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that. You are expected to meet at least one of the following criteria: You have published at least one paper at NeurIPS, ICLR, or ICML - where you were the lead author or made significant conceptual & code contributions. You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface benchmark, best in class model, etc.), preferably using multiple GPU's. You have spent at least 6 months working in a leading Machine Learning research center (e.g. at: Google Brain / Deepmind, Apple, Meta, Anthropic, Nvidia, MILA). You were an ICPC World Finalist, or an IOI, IMO, or IPhO medalist in High School. You Are A deep learning researcher, with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background are also welcome to apply). Interested in improving foundational architectures and creating new benchmarks. Experienced at hands-on experiments and model training (PyTorch, Jax, or Tensorflow). Have a good understanding of GPU architecture, memory design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git, build systems, and CI/CD. Respectful of others Fluent in English Bonus Points Knowledge of approaches used in distributed training. Familiarity with Triton Successful track-record in algorithms & data science contests. Showing a code portfolio. Why You Should Apply Join an intellectually stimulating work environment. Be a pioneer: you get to work with a new type of "Live AI" challenges around long sequences and changing data. Be part of one of an early-stage AI startup that believes in impactful research and foundational changes. Benefits Type of contract: Full-time, permanent Preferable joining date: Immediate. The positions are open until filled – please apply immediately. Compensation: six-digit annual salary based on profile and location + Employee Stock Option Plan. Location: Remote work. Possibility to work or meet with other team members in one of our offices: Palo Alto, CA; Paris, France or Wroclaw, Poland. Candidates based anywhere in the EU, UK, United States, and Canada will be considered. If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.
Palo Alto, CA, USA
$100,000/year
Craigslist
Autonomous Vehicle Operators (SAN FRANCISCO)
PLEASE FORWARD RESUME FOR CONSIDERATION Royalty Staffing is currently hiring Autonomous Vehicle Operator in San Francisco for our client who is a growing ride-share company. This is an excellent opportunity to be at the forefront of turning the company's vision into reality. We're looking for operators who are disciplined, team players, and believe in doing whatever it takes to accomplish the mission. Working as a VO will give you the opportunity to learn vehicle and personnel operations. You'll have a front-row seat to the operational complexities of realizing autonomous mobility and the chance to contribute to the future. We are currently hiring for multiple schedules, with bonus pay for weekend and night shifts with a start time of after 3pm (i.e. the weekend day shift offers an added 5% per hour and weekend night shift offers an added 10% per hour). Location – San Francisco, CA Work environment – Onsite/field Pay rate - 29.00 USD Per Hour Assignment duration – Ongoing contract SCHEDULE Hours: Day Shift: 5:45am-2:15pm, 6:45am-3:15pm, or 7:45am-4:15pm Night Shift: 1:45pm-10:15pm, 2:45pm-11:15pm, 3:45pm-12:15am, or 5:45pm-2am Days: Wednesday-Sunday Thursday-Monday Friday-Tuesday Saturday-Wednesday RESPONSIBILITIES Support vehicle operations. Drive 4-8 hours a day with a priority on safety. Conduct basic software operation tasks. Support missions through a wide variety of roles in and out of vehicles. Assist with documentation and metrics. Provide accurate written and oral feedback to engineering teams. Support vehicle maintenance and logistics. Conduct daily basic vehicle preventative maintenance checks, services, and repairs. Provide logistical support for the movement and storage of vehicles and equipment. Ensure the readiness and cleanliness of vehicles, equipment, and the workplace. Assist with paperwork and documentation related to vehicle readiness. REQUIREMENTS Basic vehicle knowledge to perform vehicle checks, ability to drive for long duration (6 hours in the car per day) Basic technology ability Excellent written and verbal communication skills Excellent driving history and no criminal history Proactive mindset and resourcefulness Bachelor's degree or equivalent technical experience is a plus BENEFITS Pre-tax commuter benefits Employer Subsidized healthcare benefits Flexible Spending Account for healthcare-related costs All costs for short- and long-term disability and life insurance 401k package
1422 Douglass St, San Francisco, CA 94131, USA
$29/hour
Workable
.NET Core Developer - DMV Systems
The ITD Department of Motor Vehicles (DMV) needs a strong .NET core/C# software engineer. This is a FULLY ONSITE POSITION located in Boise, Idaho. Remote work WILL NOT be considered. Experience: 6 Years IMPORTANT:  This is a FULLY ONSITE POSITION located in Boise, Idaho. Remote work WILL NOT be considered. Local candidates should be submitted for the position.  The ITD Department of Motor Vehicles (DMV) has the obligation to provide a variety of motor vehicle registration services, operator licensing services, and regulatory compliance services as mandated by the Idaho Legislature and applicable Federal regulations. DMV is modernizing their systems and is developing and maintaining a significant baseline of source code and associated data base structures as a part of this modernization effort. Session management in .Net Core web development will be a big part of the job. The scope of this work is to develop software systems for: 1. Driver's License Issuance and Credentialing 2. Vehicle Registration and Titling 3. Motor Vehicle regulation compliance 4. Commercial Motor Vehicle Services 5. Motor Vehicle Dealer licensing 6. Supporting Administrative Systems 7. Other features and functions as designated by the Motor Vehicle Administrator Agency Expected Deliverables 1. Reviewing, understanding and implementing defined customer requirements. 2. Reviewing, understanding and correcting identified problems. 3. Testing and verification of operation consistent with user requirements. 4. Source Code meeting ITD quality standards and in conformance with established procedures. 5. Release of object code meeting ITD quality standards and in conformance with established procedures. 6. Interacting with technical and non-technical staff as needed in the execution of the above in an Agile/Scrum environment
Boise, ID, USA
Negotiable Salary
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.