Browse
···
Log in / Register

Machine Learning Researcher / Engineer (Foundational Models)

$100,000/year

Pathway

Palo Alto, CA, USA

Favourites
Share

Description

About Pathway Pathway is building LiveAI™ systems that think and learn in real time as humans do. Our mission is to deeply understand how and why LLMs work, fundamentally changing the way models think. The team is made up of AI luminaries. Pathway's CTO, Jan Chorowski, co-authored papers with Geoff Hinton and Yoshua Bengio and was one of the first people to apply attention to speech. Our CSO, Adrian Kosowski, received his PhD in Theoretical Computer Science at the age of 20 and made significant contributions across numerous scientific fields, including AI and quantum information. He also served as a professor and a coach for competitive programmers at Ecole Polytechnique. The team also includes numerous world's top scientists and competitive programmers, alongside seasoned Silicon Valley executives. Pathway has strong investor backing. To date, we have raised over $15M; our latest reported round was our seed. Our offices are located in Palo Alto, CA, as well as Paris, France, and Wroclaw, Poland. The Opportunity This is an R&D position in attention-based models. We are currently searching for 1 or 2 R&D Engineers with a strong track record in machine learning models research. This is an extremely ambitious foundational project. There is a flexible GPU budget associated with this specific project, guaranteed to be in the 7-digit range minimum. You Will perform (distributed) model training. help improve/adapt model architectures based on experiment results. design new tasks and experiments. optionally: oversee activities of team members involved in data preparation. The results of your work will play a crucial role in the success of the project. Requirements Cover letter It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that. You are expected to meet at least one of the following criteria: You have published at least one paper at NeurIPS, ICLR, or ICML - where you were the lead author or made significant conceptual & code contributions. You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface benchmark, best in class model, etc.), preferably using multiple GPU's. You have spent at least 6 months working in a leading Machine Learning research center (e.g. at: Google Brain / Deepmind, Apple, Meta, Anthropic, Nvidia, MILA). You were an ICPC World Finalist, or an IOI, IMO, or IPhO medalist in High School. You Are A deep learning researcher, with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background are also welcome to apply). Interested in improving foundational architectures and creating new benchmarks. Experienced at hands-on experiments and model training (PyTorch, Jax, or Tensorflow). Have a good understanding of GPU architecture, memory design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git, build systems, and CI/CD. Respectful of others Fluent in English Bonus Points Knowledge of approaches used in distributed training. Familiarity with Triton Successful track-record in algorithms & data science contests. Showing a code portfolio. Why You Should Apply Join an intellectually stimulating work environment. Be a pioneer: you get to work with a new type of "Live AI" challenges around long sequences and changing data. Be part of one of an early-stage AI startup that believes in impactful research and foundational changes. Benefits Type of contract: Full-time, permanent Preferable joining date: Immediate. The positions are open until filled – please apply immediately. Compensation: six-digit annual salary based on profile and location + Employee Stock Option Plan. Location: Remote work. Possibility to work or meet with other team members in one of our offices: Palo Alto, CA; Paris, France or Wroclaw, Poland. Candidates based anywhere in the EU, UK, United States, and Canada will be considered. If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.

Source:  workable View original post

Location
Palo Alto, CA, USA
Show map

workable

You may also like

Workable
Senior Fullstack Engineer
Staff4Me is seeking an experienced and driven Senior Fullstack Engineer to join our growing team. In this role, you will be responsible for overseeing the development of both client-side and server-side components of our web applications. As a Senior Engineer, you will play a vital role in shaping our technology stack and driving best practices within our team. Key Responsibilities: Fullstack Development: Design, build, and maintain robust web applications using a variety of modern technologies, including React, Angular, Node.js, and other relevant frameworks. Develop server-side applications and APIs that are efficient, clean, and scalable. Ensure high performance and responsiveness of applications by optimizing gateway to the server performance. Technical Leadership: Lead the architectural design and development of software features, ensuring alignment with business objectives. Mentor and guide junior and mid-level developers, cultivating a collaborative learning environment. Participate in code reviews and contribute to team knowledge sharing. Collaboration: Work with cross-functional teams including UX/UI designers, product managers, and other stakeholders to define and translate business requirements into technical specifications. Actively participate in Agile ceremonies, including sprint plannings and retrospectives, to improve team processes. Quality Assurance: Implement comprehensive testing strategies (unit, integration, end-to-end) to ensure software quality. Identify and address performance bottlenecks and other issues proactively. Innovation and Continuous Improvement: Stay informed about the latest trends and advancements in technology and software engineering. Propose innovative solutions and improvements to enhance existing systems and processes. Qualifications: Requirements Bachelor’s degree in Computer Science or a related field, or equivalent experience. 5+ years of experience in fullstack development with a strong portfolio of relevant work. Proficiency in front-end technologies such as JavaScript/Typescript, HTML, CSS, and frameworks like React, Angular, or Vue.js. Strong knowledge of back-end technologies including Node.js, Python, or similar languages. Experience with database technologies, both SQL and NoSQL. Solid understanding of RESTful APIs and microservices architecture. Experience with DevOps tools and practices, including CI/CD management. Excellent problem-solving skills and the ability to work independently and as part of a team. Strong communication skills, both written and verbal, with the ability to collaborate effectively.
New York, NY, USA
Negotiable Salary
Workable
Principal Software Engineer - C++
Job Title Principal Software Engineer - Program Analysis for AI Overview We are looking for an experienced software engineer to help us build a new generation of transpilation tools enabled by AI and modern verification techniques that promises to bridge the gap between algorithm development and deployment to embedded systems. In this role you will play a lead role in architecting and implementing novel code generation pipelines that use a mix of Generative AI, Static Analysis and Formal Verification methods to translate code written in one language to another. Requirements Responsibilities ● Define Software Architecture for Agentic AI pipelines. ● Build well tested extensible code foundations for code translation products. ● Collaborate with domain specialists to incorporate formal verification and static analysis methods into code generation pipeline. ● Collaborate with the software engineering and research teams to build robust code repositories and continuous integration processes. Must Have ● Seven or more years of experience with collaborative enterprise-level software development in C++ to deliver products to a large customer base ● Demonstrated experience gathering requirements from stakeholders and distilling them into software designs ● Demonstrated experience planning and executing on large projects in a team-based setting ● Demonstrated history of building and delivering robust software by employing best practices throughout the SDLC process, including Code review, Testing, Continuous integration, Release management and Build systems Great to Have ● Experience with Compiler development - experience with Clang, LLVM ● Experience with advanced software verification techniques like fuzzing and/or formal verification ● Python experience ● Experience with ML Tools and Frameworks ● Experience working with embedded, heterogeneous (FPGA and/or GPU), and/or distributed systems
Boston, MA, USA
Negotiable Salary
Craigslist
Software coder/marketer wanting % of sales for award winning software. (Henderson)
I am looking for an entrepreneur who understands financial software coding and is looking to be able to invest in this award winning software program with no financial investment. It has already been programed. You will need the ability to unpack (decompress) the new build and create a website platform to launch it (it is an Internet based program running on all platforms). You will be required to sell it yourself or build a sales team to sell it. (I have previously sold over $500,000 my first year to individual agents myself thru my speaking company called Computer Camp). This program (called Financial Keys) won Best Product of the Year for the National Association of REALTORS® and has been one of the top selling software programs in the country to individual agents for $350 per program. The new marketing platform will be to sell it to large franchises, Boards and Associations on a subscription basis. On a subscription sale at a monthly price of only 15¢ per agent per month for a small Association of only 18K agents is an annual income of almost $33,000 (to the owners of FinKeys). A sale to a large Association like the Florida Association of REALTORS is an annual revenue stream of almost $500,000! And that’s just one sale! This is like investing in a McDonald’s franchise yet nationwide. To get a preview of what this software does, you can copy this link on your smartphone or your computer: (Full link address is: https://www.youtube.com/watch?v=0VsniiURJdI )
WR3C+2C Henderson, NV, USA
Negotiable Salary
Workable
Senior Software Developer (Gateway/Market Data)
Eagle Seven is seeking a Senior Software Developer focused on exchange connectivity and market data.  The individual will be responsible for analyzing exchange protocols, proposing design solutions, and implementing connectivity to trading venues across the world. The role be a part of the platform development team and will provide the individual with exposure to traders and strategy developers. The successful candidate will be a self-starter, have strong sense of ownership and be driven to provide technical and intellectual solutions to business problems.    Primary Responsibilities include: Architecting and implementing low-latency market access solutions Understanding, interpreting, and interfacing with global exchanges and their protocols Designing, developing, and supporting market data feed handlers and exchange order routers Diagnosing latency issues and resolving with appropriate tuning and optimizations Working with traders to source, evaluate and facilitate access to new data sources Working with extended team to capture, house, and provide historical access to market data Liaise with vendors on data and technical issues as needed to deliver rapid solutions to the business Requirements Skills and Experience: Bachelor’s degree in Computer Science or related field Proven track record of understanding and working with global exchange protocols Experience with writing parsers for exchange protocols such as FIX and ITCH, etc. Strong background in C++ and C++ Template metaprogramming with demonstrated experience using C++14/C++20 Expertise with TCP/IP, UDP multicast, sockets, network protocols, particularly on Linux/Unix systems Experience using network tools such as Wireshark and TCPDump to monitor and debug behavior Ability to work in a collaborative environment Excellent written and verbal communication skills Benefits Eagle Seven offers a competitive and comprehensive benefits package to all full-time employees. Medical PPO and HMO coverage through BlueCross BlueShield Company Contributions to a Health Savings Account (with enrollment into a High Deductible Health Plan) Dental coverage through Principal Vision coverage through VSP 401k Retirement Savings Plan with Employer Match Company Paid Life Insurance Company Paid Disability Insurance Paid Time Off Flexible Spending Account Pre-tax Transit Benefits Complimentary Lunch and Beverages The minimum base salary for this role starts at $150,000. This role is eligible for a discretionary performance bonus as part of the total compensation package, in addition to the benefits listed above. Exact compensation offered may vary based on factors including, but not limited to, the candidate's experience, qualifications, and skill set.
Chicago, IL, USA
$150,000/year
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.