Browse
···
Log in / Register

Machine Learning Researcher / Engineer (Foundational Models)

$100,000/year

Pathway

Palo Alto, CA, USA

Favourites
Share

Description

About Pathway Pathway is building LiveAI™ systems that think and learn in real time as humans do. Our mission is to deeply understand how and why LLMs work, fundamentally changing the way models think. The team is made up of AI luminaries. Pathway's CTO, Jan Chorowski, co-authored papers with Geoff Hinton and Yoshua Bengio and was one of the first people to apply attention to speech. Our CSO, Adrian Kosowski, received his PhD in Theoretical Computer Science at the age of 20 and made significant contributions across numerous scientific fields, including AI and quantum information. He also served as a professor and a coach for competitive programmers at Ecole Polytechnique. The team also includes numerous world's top scientists and competitive programmers, alongside seasoned Silicon Valley executives. Pathway has strong investor backing. To date, we have raised over $15M; our latest reported round was our seed. Our offices are located in Palo Alto, CA, as well as Paris, France, and Wroclaw, Poland. The Opportunity This is an R&D position in attention-based models. We are currently searching for 1 or 2 R&D Engineers with a strong track record in machine learning models research. This is an extremely ambitious foundational project. There is a flexible GPU budget associated with this specific project, guaranteed to be in the 7-digit range minimum. You Will perform (distributed) model training. help improve/adapt model architectures based on experiment results. design new tasks and experiments. optionally: oversee activities of team members involved in data preparation. The results of your work will play a crucial role in the success of the project. Requirements Cover letter It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that. You are expected to meet at least one of the following criteria: You have published at least one paper at NeurIPS, ICLR, or ICML - where you were the lead author or made significant conceptual & code contributions. You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface benchmark, best in class model, etc.), preferably using multiple GPU's. You have spent at least 6 months working in a leading Machine Learning research center (e.g. at: Google Brain / Deepmind, Apple, Meta, Anthropic, Nvidia, MILA). You were an ICPC World Finalist, or an IOI, IMO, or IPhO medalist in High School. You Are A deep learning researcher, with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background are also welcome to apply). Interested in improving foundational architectures and creating new benchmarks. Experienced at hands-on experiments and model training (PyTorch, Jax, or Tensorflow). Have a good understanding of GPU architecture, memory design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git, build systems, and CI/CD. Respectful of others Fluent in English Bonus Points Knowledge of approaches used in distributed training. Familiarity with Triton Successful track-record in algorithms & data science contests. Showing a code portfolio. Why You Should Apply Join an intellectually stimulating work environment. Be a pioneer: you get to work with a new type of "Live AI" challenges around long sequences and changing data. Be part of one of an early-stage AI startup that believes in impactful research and foundational changes. Benefits Type of contract: Full-time, permanent Preferable joining date: Immediate. The positions are open until filled – please apply immediately. Compensation: six-digit annual salary based on profile and location + Employee Stock Option Plan. Location: Remote work. Possibility to work or meet with other team members in one of our offices: Palo Alto, CA; Paris, France or Wroclaw, Poland. Candidates based anywhere in the EU, UK, United States, and Canada will be considered. If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.

Source:  workable View original post

Location
Palo Alto, CA, USA
Show map

workable

You may also like

Workable
C++ Market Data Engineer
Trexquant is a growing systematic fund at the forefront of quantitative finance, with a core team of highly accomplished researchers and engineers. To keep pace with our expanding global trading operations, we are seeking a C++ Market Data Engineer to design and build ultra-low-latency feed handlers for premier vendor feeds and major exchange multicast feeds. This is a high-impact role that sits at the heart of Trexquant's trading platform; the quality, speed, and reliability of your code directly influence every strategy we run. Responsibilities Design & implement high-performance feed handlers in modern C++ for equities, futures, and options across global venues (e.g., NYSE, CME, Refinitiv RTS, Bloomberg B-PIPE). Optimize for micro- and nanosecond latency using lock-free data structures, cache-friendly memory layouts, and kernel-bypass networking where appropriate. Build reusable libraries for message decoding, normalization, and publication to internal buses shared by research, simulation, and live trading systems. Collaborate with cross-functional teams to tune TCP/UDP multicast stacks, kernel parameters, and NIC settings for deterministic performance. Provide robust failover, gap-recovery, and replay mechanisms to guarantee data integrity under packet loss or venue outages. Instrument code paths with precision timestamping and performance metrics; drive continuous latency regression testing and capacity planning. Partner closely with quantitative researchers to understand downstream data requirements and to fine-tune delivery formats for both simulation and live trading. Produce clear architecture documents, operational run-books, and post-mortems; participate in a 24×7 follow-the-sun support rotation for mission-critical market-data services. Requirements BS/MS/PhD in Computer Science, Electrical Engineering, or related field. 3+ years of professional C++ (14,17,20) development experience focused on low-latency, high-throughput systems. Proven track record building or maintaining real-time market-data feeds (e.g., Refinitiv RTS/TREP, Bloomberg B-PIPE, OPRA, CME MDP, ITCH). Strong grasp of concurrency, lock-free algorithms, memory-model semantics, and compiler optimizations. Familiarity with serialization formats (FAST, SBE, Protocol Buffers) and time-series databases or in-memory caches. Comfort with scripting in Python for prototyping, testing, and ops automation. Excellent problem-solving skills, ownership mindset, and ability to thrive in a fast-paced trading environment. Familiarity with containerization (Docker/K8s) and public-cloud networking (AWS, GCP). Benefits Competitive salary, plus bonus based on individual and company performance. Collaborative, casual, and friendly work environment while solving the hardest problems in the financial markets. PPO Health, dental and vision insurance premiums fully covered for you and your dependents. Pre-Tax Commuter Benefits  Trexquant is an Equal Opportunity Employer
Stamford, CT, USA
Negotiable Salary
Craigslist
Software Engineer - Full Stack Developer
Unison Consulting, Inc. (Unison) is a leading aviation consulting firm with more than 36 years of experience advising clients around the world. Unison has provided aviation consulting services to over 150 airports throughout the United States, Canada, South America and the Caribbean, including 17 of the 25 busiest U.S. airports. Unison is a recognized industry leader and we seek self-motivated, driven individuals to help us bring innovative ideas to our clients. Interested individuals should have talents and experience commensurate with the position being pursued. Unison is seeking a software engineer who can wear every hat in the full development life cycle of Net applications. Our ideal candidate is a highly motivated individual who is looking to learn about the latest programming languages to create and extend applications that are flexible and scalable across a wide range of strategies. Primary duties will include UI development, database development, integration with 3rd party applications and middleware development. Required Experience • Minimum 3 years’ experience with the following technologies: • HTML, JavaScript, jQuery, AJAX, Bootstrap • Visual Studio • Writing SQL queries and stored procedures of moderate complexity on SQL Server or Oracle PLSQL • Knowledge of SOAP and RESTful web services • Self-motivated; experience working well both independently and collaboratively Desired but not required • Experience with PowerBI • Experience with Oracle Report Builder Job Type This is a permanent fulltime onsite position (hybrid work environment). No contract/ C2C or sponsorship is available for this position. Must be authorized to work in the US without expiration. Required Education • Bachelor degree preferred • Minimum High School with a minimum of 5 years work experience Other Requirements • Must pass an aptitude test Benefits • Competitive base salary and overall compensation package • Full benefits: Medical, Dental, Vision • Paid vacation, sick, and holidays • Life Insurance coverage • 401K with company match • Tuition reimbursement Schedule: • 8 hour shift Pay $83,000.00 $87,000.00 per year
Lakeshore East Dog Park, 450 E Benton Pl, Chicago, IL 60601, USA
$83,000-87,000/year
Workable
Machine Learning Engineer, ML Runtime & Optimization
Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and inferences of the AI models in autonomous driving systems. This includes: Identifying key applications for current and future autonomous driving problems and performing in-depth analysis and optimization to ensure the best possible performance on current and next-generation compute architectures. Collaborating closely with diverse groups in Pony.ai including both hardware and software to optimize and craft core parallel algorithms as well as to influence the next-generation compute platform architecture design and software infrastructure. Apply model optimization and efficient deep learning techniques to models and optimized ML operator libraries. Work across the entire ML framework/compiler stack (e.g.Torch, CUDA and TensorRT), and system-efficient deep learning models. Requirements BS/MS or Ph.D in computer science, electrical engineering or a related discipline. Strong programming skills in C/C++ or Python. Experience on model optimization, quantization or other efficient deep learning techniques Good understanding of hardware performance, regarding CPU or GPU execution model, threads, registers, cache, cost/performance trade-off, etc. Experience with profiling, benchmarking and validating performance for complex computing architectures. Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks. Strong communication skills and ability to work cross-functionally between software and hardware teams Preferred Qualifications: One or more of the following fields are preferred Experience with parallel programming, ideally CUDA, OpenCL or OpenACC. Experience in computer vision, machine learning and deep learning. Strong knowledge of software design, programming techniques and algorithms. Good knowledge of common deep learning frameworks and libraries. Deep knowledge on system performance, GPU optimization or ML compiler. Compensation and Benefits Base Salary Range: $140,000 - $250,000 Annually Compensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units. Also, we provide the following benefits to the eligible employees: Health Care Plan (Medical, Dental & Vision) Retirement Plan (Traditional and Roth 401k) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation & Public Holidays) Family Leave (Maternity, Paternity) Short Term & Long Term Disability Free Food & Snacks
Fremont, CA, USA
$140,000-250,000/year
Craigslist
Autonomous Vehicle Operators (SAN FRANCISCO)
PLEASE FORWARD RESUME FOR CONSIDERATION Royalty Staffing is currently hiring Autonomous Vehicle Operator in San Francisco for our client who is a growing ride-share company. This is an excellent opportunity to be at the forefront of turning the company's vision into reality. We're looking for operators who are disciplined, team players, and believe in doing whatever it takes to accomplish the mission. Working as a VO will give you the opportunity to learn vehicle and personnel operations. You'll have a front-row seat to the operational complexities of realizing autonomous mobility and the chance to contribute to the future. We are currently hiring for multiple schedules, with bonus pay for weekend and night shifts with a start time of after 3pm (i.e. the weekend day shift offers an added 5% per hour and weekend night shift offers an added 10% per hour). Location – San Francisco, CA Work environment – Onsite/field Pay rate - 29.00 USD Per Hour Assignment duration – Ongoing contract SCHEDULE Hours: Day Shift: 5:45am-2:15pm, 6:45am-3:15pm, or 7:45am-4:15pm Night Shift: 1:45pm-10:15pm, 2:45pm-11:15pm, 3:45pm-12:15am, or 5:45pm-2am Days: Wednesday-Sunday Thursday-Monday Friday-Tuesday Saturday-Wednesday RESPONSIBILITIES Support vehicle operations. Drive 4-8 hours a day with a priority on safety. Conduct basic software operation tasks. Support missions through a wide variety of roles in and out of vehicles. Assist with documentation and metrics. Provide accurate written and oral feedback to engineering teams. Support vehicle maintenance and logistics. Conduct daily basic vehicle preventative maintenance checks, services, and repairs. Provide logistical support for the movement and storage of vehicles and equipment. Ensure the readiness and cleanliness of vehicles, equipment, and the workplace. Assist with paperwork and documentation related to vehicle readiness. REQUIREMENTS Basic vehicle knowledge to perform vehicle checks, ability to drive for long duration (6 hours in the car per day) Basic technology ability Excellent written and verbal communication skills Excellent driving history and no criminal history Proactive mindset and resourcefulness Bachelor's degree or equivalent technical experience is a plus BENEFITS Pre-tax commuter benefits Employer Subsidized healthcare benefits Flexible Spending Account for healthcare-related costs All costs for short- and long-term disability and life insurance 401k package
1422 Douglass St, San Francisco, CA 94131, USA
$29/hour
Workable
Low-Latency Developer
Atto Trading, a dynamic quantitative trading firm founded in 2010 and leading in global high-frequency strategies, is looking for a Low-Latency Developer to join our team. We are expanding an international, diverse team, with experts in trading, statistics, engineering, and technology. Our disciplined approach combined with rapid market feedback allows us to quickly turn ideas into profit. Our environment of learning and collaboration allows us to solve some of the world’s hardest problems, together. As a small firm, we remain nimble and hold ourselves to the highest standards of integrity, ingenuity, and effort.  Position Highlights: We are modernizing our trading and research platform to scale our alpha trading business. This platform will enable researchers to explore, test, and deploy sophisticated signals, models, and strategies across asset classes in a robust, fully automated manner while maintaining competitive latency targets. As a Low-Latency Developer, you will be responsible for designing, optimizing, and maintaining high-performance trading systems to minimize latency. Your Mission and Goals: Analyze and optimize the performance of low-latency trading systems by identifying bottlenecks and inefficiencies in the code, and implementing effective solutions.  Develop and adapt the platform to support the demands of a fast-paced trading environment, while effectively managing technical debt. Requirements Over 5 years of experience as a low-latency developer with a focus on performance optimization in a high-frequency trading (HFT) environment. Experience with multiple components of an HFT platform or system, particularly those on the critical path. Experience working at an HFT firm during its startup phase and/or on a trading team is a significant plus. Technical Skills: Deep knowledge of HFT platforms: networking, kernel bypass, market data, order entry, threading, inter-process communication, and strategy APIs. Proven low-latency development and performance optimization in HFT. Strong proficiency in C++. Excellent understanding of CPU caches and cache efficiency. Experience with multithreaded and multi-process synchronization. Good understanding of networking protocols. Skilled in performance profiling and optimization tools. Advanced knowledge of Linux operating systems, including kernel-level device mechanisms. About You: Practical decision-making skills. Excellent communication skills. Strong analytical and problem-solving skills. Passion for trading. Ability to work independently and as part of a team. Benefits Competitive rates of pay. Paid time off (5 weeks). Coverage of health insurance costs. Office lunches. Discretionary bonus system.  Annual base salary range of $150,000 to $300,000. Pay (base and bonus) may vary depending on job-related skills and experience. Our motivation: We are a company committed to staying at the forefront of technology. Our team is passionate about continual learning and improvement. With no external investors or customers, we are the primary users of the products we create, giving you the opportunity to make a real impact on our company's growth. Ready to advance your career? Join our innovative team and help shape the future of trading on a global scale. Apply now and let's create the future together!
New York, NY, USA
$150,000-300,000/year
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.