Browse
···
Log in / Register

Machine Learning Researcher / Engineer (Foundational Models)

$100,000/year

Pathway

Palo Alto, CA, USA

Favourites
Share

Description

About Pathway Pathway is building LiveAI™ systems that think and learn in real time as humans do. Our mission is to deeply understand how and why LLMs work, fundamentally changing the way models think. The team is made up of AI luminaries. Pathway's CTO, Jan Chorowski, co-authored papers with Geoff Hinton and Yoshua Bengio and was one of the first people to apply attention to speech. Our CSO, Adrian Kosowski, received his PhD in Theoretical Computer Science at the age of 20 and made significant contributions across numerous scientific fields, including AI and quantum information. He also served as a professor and a coach for competitive programmers at Ecole Polytechnique. The team also includes numerous world's top scientists and competitive programmers, alongside seasoned Silicon Valley executives. Pathway has strong investor backing. To date, we have raised over $15M; our latest reported round was our seed. Our offices are located in Palo Alto, CA, as well as Paris, France, and Wroclaw, Poland. The Opportunity This is an R&D position in attention-based models. We are currently searching for 1 or 2 R&D Engineers with a strong track record in machine learning models research. This is an extremely ambitious foundational project. There is a flexible GPU budget associated with this specific project, guaranteed to be in the 7-digit range minimum. You Will perform (distributed) model training. help improve/adapt model architectures based on experiment results. design new tasks and experiments. optionally: oversee activities of team members involved in data preparation. The results of your work will play a crucial role in the success of the project. Requirements Cover letter It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that. You are expected to meet at least one of the following criteria: You have published at least one paper at NeurIPS, ICLR, or ICML - where you were the lead author or made significant conceptual & code contributions. You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface benchmark, best in class model, etc.), preferably using multiple GPU's. You have spent at least 6 months working in a leading Machine Learning research center (e.g. at: Google Brain / Deepmind, Apple, Meta, Anthropic, Nvidia, MILA). You were an ICPC World Finalist, or an IOI, IMO, or IPhO medalist in High School. You Are A deep learning researcher, with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background are also welcome to apply). Interested in improving foundational architectures and creating new benchmarks. Experienced at hands-on experiments and model training (PyTorch, Jax, or Tensorflow). Have a good understanding of GPU architecture, memory design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git, build systems, and CI/CD. Respectful of others Fluent in English Bonus Points Knowledge of approaches used in distributed training. Familiarity with Triton Successful track-record in algorithms & data science contests. Showing a code portfolio. Why You Should Apply Join an intellectually stimulating work environment. Be a pioneer: you get to work with a new type of "Live AI" challenges around long sequences and changing data. Be part of one of an early-stage AI startup that believes in impactful research and foundational changes. Benefits Type of contract: Full-time, permanent Preferable joining date: Immediate. The positions are open until filled – please apply immediately. Compensation: six-digit annual salary based on profile and location + Employee Stock Option Plan. Location: Remote work. Possibility to work or meet with other team members in one of our offices: Palo Alto, CA; Paris, France or Wroclaw, Poland. Candidates based anywhere in the EU, UK, United States, and Canada will be considered. If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.

Source:  workable View original post

Location
Palo Alto, CA, USA
Show map

workable

You may also like

Workable
Oracle IDM /ADF - Washington, DC (Long-Term)
Title: Oracle IDM (Oracle Identity Management) Location: Washington, DC Position: Contract Duration: 1+ Year’s Rate: $Market/Hr Job Overview & Responsibilities:- The candidate will provide technical consulting/advisory, implementation, as well as general maintenance and operational support services to our clients in the following areas: • Experience with Oracle IDM (Oracle Identity Management) OR Oracle ADF. • Experience with Sailpoint Identity IQ, including design, development and implementation skills. • Experience in implementation of enterprise-level, and distributed server-side applications Integration experience with database server technologies such as Microsoft SQL, Oracle, etc. • Integration experience with web server technologies such as Apache, IIS, etc. • Integration experience with directory server technologies such as Active Directory, openLdap, etc. • Ability to meet project schedule deliverables • Demonstrated ability to write documentation deliverables including recommendations, assessments, root cause analyses, project roadmaps, and other reports • Must possess excellent Microsoft Word, Excel, Visio, and PowerPoint skills • Security product certifications desired; CISSP • General Web development and programming skills with either Java, .NET, PHP, HTML, XML, CSS, Perl, shell, or SQL are highly desirable • Experience with smart card technology, Smart Card Management Systems, and PKI a plus • Must be familiar with Federal policies and standards on information security and authentication (FIPS, NIST, PIV, etc.) • Solid understanding of Federal credentialing standards (PIV, PIV-I/C, TWIC, FRAC, etc.) • Experience in assessment, evaluation, and design of solutions related to encryption and key management, identity management, strong authentication, and end-point security • Experience in implementation of federation/SAML technologies such as ADFS, etc. Requirements Note: If interested please send your updated resume to gowri.sankar@two95intl.com and include your Salary requirement along with your contact details with a suitable time when we can reach you. If you know of anyone in your sphere of contacts, who would be a perfect match for this job then, we would appreciate if you can forward this posting to them with a copy to us. We look forward to hearing from you at the earliest!
Washington, DC, USA
Negotiable Salary
Workable
Data Engineer (USA)
Trexquant is a growing systematic fund manager with a core team of highly accomplished technologists. We apply a wide variety of statistical and machine learning techniques to build investment portfolios and trade our client assets in global equity and futures markets. With locations in the US, China and India, our global team in excess of 50 employees is comprised primarily of research professionals with advanced science, math and technology degrees who explore the universe of quantitative methods for opportunities to enhance and adapt our platform and profit in an exciting and dynamic environment. We are seeking a dedicated and detail-oriented Data Engineer to join our team. The primary goal of this team is to assist researchers in writing, downloading and reading scripts, effectively translating their ideas into data variables for trading purposes. The ideal candidate will have a strong background in data engineering, excellent scripting skills, and a keen understanding of financial data and trading strategies. Responsibilities Collaborate with researchers to understand their data requirements and trading strategies. Develop and maintain scripts for downloading, reading, and processing data from various sources. Ensure data accuracy, consistency, and reliability by implementing quality control measures. Optimize data retrieval processes for efficiency and performance. Assist in the integration of new data sources and formats into the existing data infrastructure. Provide technical support and troubleshooting for data-related issues. Document data workflows, processes, and scripts for future reference and knowledge sharing. Stay updated with industry trends and advancements in data engineering and financial data. Requirements A degree in a technical discipline (computer science, mathematics, statistics, physics, etc.)  1+ years of experience in Python as used in data capacity Experience in financial services and working with financial data providers is a plus Ability to work independently and take projects to completion, quickly learn new systems, think creatively and pay attention to details Benefits Competitive salary plus bonus bonus based on individual and company performance Collaborative, Casual, and friendly work environment PPO Health, dental and vision insurance premiums fully covered for you and your dependents Pre-tax commuter benefits Weekly company meals Trexquant is an Equal Opportunity Employer
Stamford, CT, USA
Negotiable Salary
Craigslist
Business Analyst - Full Time / Full Benefits (Factoria)
Business Analyst w/ a focus on: cyber detective job focused on SMS security Overview: The primary day to day role of this backfill will be reviewing commercial SMS traffic using various databases and web portals as well as running Splunk queries in order to identify phishing, Social Engineering, or other illegal/disallowed content. Key Responsibilities: Review commercial SMS traffic using internal databases and web portals. Execute and analyze Splunk queries to detect suspicious or prohibited content. Identify phishing, social engineering, and other malicious messaging patterns. Adhere to internal guides and industry standards Maintain detailed documentation and tracking reports for investigations. Handle confidential content with discretion and professionalism. Required Skills & Qualifications: Strong aptitude for reviewing large datasets with high attention to detail. Solid understanding of phishing and social engineering tactics. Experience with database querying tools (preferably Splunk). Thorough and cautious approach to investigations, with an understanding of the potential impact of errors in a production environment. Proven ability to maintain accurate and comprehensive documentation. Familiarity with Shortcode, 10DLC, and Toll-Free number systems. Preferred Qualifications: Prior experience working on confidential or sensitive projects. Exposure to dark projects or environments involving secure content review.
7540 Leary Wy, Redmond, WA 98052, USA
$40/hour
Workable
Senior/Lead Backend (NodeJS) Engineer - Onsite
About Deep Origin Led by Michael Antonov, a co-founder of Oculus, and well-funded by Formic Ventures, Deep Origin is poised to reinvent the way scientists work and life science innovations come to life. We see a future largely free of diseases, with a 150-year lifespan being a norm. To get there, we are building an operating system for science, enabling scientists to be more productive and to bring tomorrow's ideas to life quickly and at a reasonable cost. Applicants must be authorized to work for any employer in the U.S. We are unable to sponsor or take over sponsorship of an employment Visa at this time. Role Description In this hands-on position, you will be a key member of the software engineering team, building our key functionality and integrating with key partners. Your responsibilities will range from designing and developing complex, large-scale systems to writing APIs that integrate with various cloud providers and partners. You will have ownership in key software feature areas and their architectural design, as well as software implementation with a high level of independence and impact. Requirements 7+ years of experience designing, building, and operating complex, highly-scalable, distributed applications and systems 3+ years of hands-on software development experience with TypeScript/JavaScript/NodeJS Experience with both relational databases (e.g. Postgres) and NOSQL (e.g. MongoDB) Knowledge of Kubernetes and Cloud infrastructure/deployment tools (specifically with cluster operations and operators) Has built platforms from an early stage Has scaled platforms to handle many users (10,000+ DAU) Has extensive system-design experience Has experience designing systems with complex data-sets/relations Has experience working with distributed systems/platforms Thinks about architecture first and how the code fits in second Has experience working with/implementing a multi-tenant system Systematic problem-solving approach, coupled with a strong sense of ownership and drive Ability to work both independently and on the team Experience working in high-energy startups with fast product delivery mechanisms Benefits Benefits This position offers a competitive salary, benefits, and equity.
South San Francisco, CA, USA
Negotiable Salary
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.