Browse
···
Log in / Register

Machine Learning Researcher / Engineer (Foundational Models)

$100,000/year

Pathway

Palo Alto, CA, USA

Favourites
Share

Description

About Pathway Pathway is building LiveAI™ systems that think and learn in real time as humans do. Our mission is to deeply understand how and why LLMs work, fundamentally changing the way models think. The team is made up of AI luminaries. Pathway's CTO, Jan Chorowski, co-authored papers with Geoff Hinton and Yoshua Bengio and was one of the first people to apply attention to speech. Our CSO, Adrian Kosowski, received his PhD in Theoretical Computer Science at the age of 20 and made significant contributions across numerous scientific fields, including AI and quantum information. He also served as a professor and a coach for competitive programmers at Ecole Polytechnique. The team also includes numerous world's top scientists and competitive programmers, alongside seasoned Silicon Valley executives. Pathway has strong investor backing. To date, we have raised over $15M; our latest reported round was our seed. Our offices are located in Palo Alto, CA, as well as Paris, France, and Wroclaw, Poland. The Opportunity This is an R&D position in attention-based models. We are currently searching for 1 or 2 R&D Engineers with a strong track record in machine learning models research. This is an extremely ambitious foundational project. There is a flexible GPU budget associated with this specific project, guaranteed to be in the 7-digit range minimum. You Will perform (distributed) model training. help improve/adapt model architectures based on experiment results. design new tasks and experiments. optionally: oversee activities of team members involved in data preparation. The results of your work will play a crucial role in the success of the project. Requirements Cover letter It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that. You are expected to meet at least one of the following criteria: You have published at least one paper at NeurIPS, ICLR, or ICML - where you were the lead author or made significant conceptual & code contributions. You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface benchmark, best in class model, etc.), preferably using multiple GPU's. You have spent at least 6 months working in a leading Machine Learning research center (e.g. at: Google Brain / Deepmind, Apple, Meta, Anthropic, Nvidia, MILA). You were an ICPC World Finalist, or an IOI, IMO, or IPhO medalist in High School. You Are A deep learning researcher, with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background are also welcome to apply). Interested in improving foundational architectures and creating new benchmarks. Experienced at hands-on experiments and model training (PyTorch, Jax, or Tensorflow). Have a good understanding of GPU architecture, memory design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git, build systems, and CI/CD. Respectful of others Fluent in English Bonus Points Knowledge of approaches used in distributed training. Familiarity with Triton Successful track-record in algorithms & data science contests. Showing a code portfolio. Why You Should Apply Join an intellectually stimulating work environment. Be a pioneer: you get to work with a new type of "Live AI" challenges around long sequences and changing data. Be part of one of an early-stage AI startup that believes in impactful research and foundational changes. Benefits Type of contract: Full-time, permanent Preferable joining date: Immediate. The positions are open until filled – please apply immediately. Compensation: six-digit annual salary based on profile and location + Employee Stock Option Plan. Location: Remote work. Possibility to work or meet with other team members in one of our offices: Palo Alto, CA; Paris, France or Wroclaw, Poland. Candidates based anywhere in the EU, UK, United States, and Canada will be considered. If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.

Source:  workable View original post

Location
Palo Alto, CA, USA
Show map

workable

You may also like

Workable
Full Stack Developer - Hybrid/Atlanta, GA
    Title – Full Stack Developer     Position – 12 + Months Contract     Location – Hybrid/Atlanta, GA     Rate- $Open(Best Possible) Design and develop user interfaces for web and mobile applications Collaborate with cross-functional teams to gather and evaluate user requirements Create and maintain reusable code libraries and UI components  2+ years’ experience in Java, Full-stack, C#, .NET and/or Python development experience is required  Develop amazingly efficient and effective software using Java, C#, Python, .Net, Spring Boot, Microservices, APIs Enhance and maintain on prem and internal PAAS applications, and actively rework these to AWS along project timelines Help to design and implement serverless patterns from containerized applications Build industry standard APIs and help with establishing, consuming & routing calls, connectivity protocols and policy Design, develop and implement architecture patterns that are optimized for SLAs, reliability, and cost Look upstream and downstream to see around corners and anticipate future consequences for immediate technical choices Help to establish and grow a culture of software craftsmanship best practices, including TDD/BDD and Test Automation (both Unit and Integration), Continuous Integration, and Continuous Deployment Note: If interested please send your updated resume and include your rate requirement along with your contact details with a suitable time when we can reach you. If you know of anyone in your sphere of contacts, who would be a perfect match for this job then, we would appreciate if you can forward this posting to them with a copy to us. We look forward to hearing from you at the earliest!
Atlanta, GA, USA
Negotiable Salary
Workable
Senior Fullstack Engineer
Staff4Me is seeking an experienced and driven Senior Fullstack Engineer to join our growing team. In this role, you will be responsible for overseeing the development of both client-side and server-side components of our web applications. As a Senior Engineer, you will play a vital role in shaping our technology stack and driving best practices within our team. Key Responsibilities: Fullstack Development: Design, build, and maintain robust web applications using a variety of modern technologies, including React, Angular, Node.js, and other relevant frameworks. Develop server-side applications and APIs that are efficient, clean, and scalable. Ensure high performance and responsiveness of applications by optimizing gateway to the server performance. Technical Leadership: Lead the architectural design and development of software features, ensuring alignment with business objectives. Mentor and guide junior and mid-level developers, cultivating a collaborative learning environment. Participate in code reviews and contribute to team knowledge sharing. Collaboration: Work with cross-functional teams including UX/UI designers, product managers, and other stakeholders to define and translate business requirements into technical specifications. Actively participate in Agile ceremonies, including sprint plannings and retrospectives, to improve team processes. Quality Assurance: Implement comprehensive testing strategies (unit, integration, end-to-end) to ensure software quality. Identify and address performance bottlenecks and other issues proactively. Innovation and Continuous Improvement: Stay informed about the latest trends and advancements in technology and software engineering. Propose innovative solutions and improvements to enhance existing systems and processes. Qualifications: Requirements Bachelor’s degree in Computer Science or a related field, or equivalent experience. 5+ years of experience in fullstack development with a strong portfolio of relevant work. Proficiency in front-end technologies such as JavaScript/Typescript, HTML, CSS, and frameworks like React, Angular, or Vue.js. Strong knowledge of back-end technologies including Node.js, Python, or similar languages. Experience with database technologies, both SQL and NoSQL. Solid understanding of RESTful APIs and microservices architecture. Experience with DevOps tools and practices, including CI/CD management. Excellent problem-solving skills and the ability to work independently and as part of a team. Strong communication skills, both written and verbal, with the ability to collaborate effectively.
New York, NY, USA
Negotiable Salary
Workable
Principal Software Engineer - C++
Job Title Principal Software Engineer - Program Analysis for AI Overview We are looking for an experienced software engineer to help us build a new generation of transpilation tools enabled by AI and modern verification techniques that promises to bridge the gap between algorithm development and deployment to embedded systems. In this role you will play a lead role in architecting and implementing novel code generation pipelines that use a mix of Generative AI, Static Analysis and Formal Verification methods to translate code written in one language to another. Requirements Responsibilities ● Define Software Architecture for Agentic AI pipelines. ● Build well tested extensible code foundations for code translation products. ● Collaborate with domain specialists to incorporate formal verification and static analysis methods into code generation pipeline. ● Collaborate with the software engineering and research teams to build robust code repositories and continuous integration processes. Must Have ● Seven or more years of experience with collaborative enterprise-level software development in C++ to deliver products to a large customer base ● Demonstrated experience gathering requirements from stakeholders and distilling them into software designs ● Demonstrated experience planning and executing on large projects in a team-based setting ● Demonstrated history of building and delivering robust software by employing best practices throughout the SDLC process, including Code review, Testing, Continuous integration, Release management and Build systems Great to Have ● Experience with Compiler development - experience with Clang, LLVM ● Experience with advanced software verification techniques like fuzzing and/or formal verification ● Python experience ● Experience with ML Tools and Frameworks ● Experience working with embedded, heterogeneous (FPGA and/or GPU), and/or distributed systems
Boston, MA, USA
Negotiable Salary
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.