Browse
···
Log in / Register

Machine Learning Researcher / Engineer (Foundational Models)

$100,000/year

Pathway

Palo Alto, CA, USA

Favourites
Share

Description

About Pathway Pathway is building LiveAI™ systems that think and learn in real time as humans do. Our mission is to deeply understand how and why LLMs work, fundamentally changing the way models think. The team is made up of AI luminaries. Pathway's CTO, Jan Chorowski, co-authored papers with Geoff Hinton and Yoshua Bengio and was one of the first people to apply attention to speech. Our CSO, Adrian Kosowski, received his PhD in Theoretical Computer Science at the age of 20 and made significant contributions across numerous scientific fields, including AI and quantum information. He also served as a professor and a coach for competitive programmers at Ecole Polytechnique. The team also includes numerous world's top scientists and competitive programmers, alongside seasoned Silicon Valley executives. Pathway has strong investor backing. To date, we have raised over $15M; our latest reported round was our seed. Our offices are located in Palo Alto, CA, as well as Paris, France, and Wroclaw, Poland. The Opportunity This is an R&D position in attention-based models. We are currently searching for 1 or 2 R&D Engineers with a strong track record in machine learning models research. This is an extremely ambitious foundational project. There is a flexible GPU budget associated with this specific project, guaranteed to be in the 7-digit range minimum. You Will perform (distributed) model training. help improve/adapt model architectures based on experiment results. design new tasks and experiments. optionally: oversee activities of team members involved in data preparation. The results of your work will play a crucial role in the success of the project. Requirements Cover letter It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that. You are expected to meet at least one of the following criteria: You have published at least one paper at NeurIPS, ICLR, or ICML - where you were the lead author or made significant conceptual & code contributions. You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface benchmark, best in class model, etc.), preferably using multiple GPU's. You have spent at least 6 months working in a leading Machine Learning research center (e.g. at: Google Brain / Deepmind, Apple, Meta, Anthropic, Nvidia, MILA). You were an ICPC World Finalist, or an IOI, IMO, or IPhO medalist in High School. You Are A deep learning researcher, with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background are also welcome to apply). Interested in improving foundational architectures and creating new benchmarks. Experienced at hands-on experiments and model training (PyTorch, Jax, or Tensorflow). Have a good understanding of GPU architecture, memory design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git, build systems, and CI/CD. Respectful of others Fluent in English Bonus Points Knowledge of approaches used in distributed training. Familiarity with Triton Successful track-record in algorithms & data science contests. Showing a code portfolio. Why You Should Apply Join an intellectually stimulating work environment. Be a pioneer: you get to work with a new type of "Live AI" challenges around long sequences and changing data. Be part of one of an early-stage AI startup that believes in impactful research and foundational changes. Benefits Type of contract: Full-time, permanent Preferable joining date: Immediate. The positions are open until filled – please apply immediately. Compensation: six-digit annual salary based on profile and location + Employee Stock Option Plan. Location: Remote work. Possibility to work or meet with other team members in one of our offices: Palo Alto, CA; Paris, France or Wroclaw, Poland. Candidates based anywhere in the EU, UK, United States, and Canada will be considered. If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.

Source:  workable View original post

Location
Palo Alto, CA, USA
Show map

workable

You may also like

Workable
Senior Backend Platform Engineer, Distributed Systems
Help redefine how the DoD makes multi-billion-dollar force-design decisions. In this role, you'll fuse physics-driven simulation, interactive computing, and verified AI code-generation to create next-generation wargaming platforms. If building ultra-low-latency APIs, taming high-volume geospatial data, and leading with clean, production-ready Python gets you fired up, let’s talk. What you’ll do Own the service layer—design, build, and scale FastAPI micro-services and lightning-fast ZeroMQ messaging pipelines running in Kubernetes and bare-metal clusters. Wrangle data at speed & scale—shape and query multi-TB Postgres/PostGIS datasets, orchestrate Redis for sub-millisecond state, and keep everything rock-solid under bursty load. Glue the stack together—expose crisp, well-versioned REST & WebSocket endpoints for the frontend crew and simulation kernel. Ship continuously—automate CI/CD, observability, and security hardening to DoD standards; push to prod with confidence. Lead by doing—drive code reviews, mentor teammates, and set the standard for test coverage and documentation. Why Code Metal? Mission with impact: your APIs become the nervous system of digital battlefields influencing multi-billion-dollar defense acquisitions. Velocity: tight-knit teams, weekly releases, zero bureaucratic drag. Ownership: no passengers—every engineer ships code that matters. Requirements Must-have credentials 4+ years building production backends in modern Python, with deep FastAPI (or equivalent async framework) experience. Proven expertise in ZeroMQ, NATS, Kafka, or similar high-throughput messaging systems. Hands-on with Postgres/PostGIS and Redis in performance-critical workloads. Cloud-native chops—Docker, Kubernetes, and one major provider (AWS, GCP, Azure, or GovCloud). Active Secret clearance or eligibility to obtain one.  Bonus points C++ or Rust skills for bridging high-performance simulation modules. Hardened services for FedRAMP / STIG compliance. Observability mindset—Prometheus, Grafana, OpenTelemetry. TS/SCI clearance. Benefits Health care plan with 100% premium coverage, including medical, dental, and vision. 401k with 5% matching. Paid Time Off (Uncapped Vacation, plus Sick & Public Holidays). Flexible hybrid work arrangement. Relocation assistance for qualifying employees.
Boston, MA, USA
Negotiable Salary
Workable
Data & BI Senior Data Engineer
Job Description: We are seeking a highly skilled and experienced Senior Data Engineer to join our team. The ideal candidate will have a strong background in data engineering, with a specialization in Matillion, SSIS, Azure DevOps, and ETL processes. This role will involve designing, developing, testing, and deploying ETL jobs, collaborating with cross-functional teams, and ensuring efficient data processing. Key Responsibilities: Design, develop, test, and deploy Matillion ETL jobs in accordance with project requirements. Collaborate with the Data and BI team to understand data integration needs and translate them into Matillion ETL solutions. Create and modify Python code/components in Matillion jobs. Identify opportunities for performance optimization and implement enhancements to ensure efficient data processing. Collaborate with cross-functional teams, including database administrators, data engineers, and business analysts, to ensure seamless integration of ETL processes. Create and maintain comprehensive documentation for Matillion ETL jobs, ensuring knowledge transfer within the team. Create, test, and deploy SQL Server Integration Service (SSIS) packages and schedule them via Active Batch scheduling tool. Create Matillion deployment builds using Azure DevOps CI/CD pipeline and perform release manager activities. Review code of other developers (L2, L3-BI/DI) to ensure code standards and provide approval as part of code review activities. Resolve escalation tickets from the L2 team as part of the on-call schedule. Working knowledge of API and Postman tool is an added advantage. Qualifications: Bachelor's or Master's degree in Computer Science, Engineering, or a related field. 5+ years of experience in data engineering, with a focus on ETL processes. Proficiency in Matillion, SSIS, Azure DevOps, and ETL. Strong knowledge of SQL, Python, and data integration techniques. Experience with performance optimization and data processing enhancements. Excellent collaboration and communication skills. Ability to work in a fast-paced, dynamic environment. Preferred Skills: Experience with cloud platforms such as AWS or Azure. Knowledge of data warehousing and data modeling. Familiarity with DevOps practices and CI/CD pipelines. Strong problem-solving skills and attention to detail.
Atlanta, GA, USA
Negotiable Salary
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.