Browse
···
Log in / Register

Machine Learning Researcher / Engineer (Foundational Models)

$100,000/year

Pathway

Palo Alto, CA, USA

Favourites
Share

Description

About Pathway Pathway is building LiveAI™ systems that think and learn in real time as humans do. Our mission is to deeply understand how and why LLMs work, fundamentally changing the way models think. The team is made up of AI luminaries. Pathway's CTO, Jan Chorowski, co-authored papers with Geoff Hinton and Yoshua Bengio and was one of the first people to apply attention to speech. Our CSO, Adrian Kosowski, received his PhD in Theoretical Computer Science at the age of 20 and made significant contributions across numerous scientific fields, including AI and quantum information. He also served as a professor and a coach for competitive programmers at Ecole Polytechnique. The team also includes numerous world's top scientists and competitive programmers, alongside seasoned Silicon Valley executives. Pathway has strong investor backing. To date, we have raised over $15M; our latest reported round was our seed. Our offices are located in Palo Alto, CA, as well as Paris, France, and Wroclaw, Poland. The Opportunity This is an R&D position in attention-based models. We are currently searching for 1 or 2 R&D Engineers with a strong track record in machine learning models research. This is an extremely ambitious foundational project. There is a flexible GPU budget associated with this specific project, guaranteed to be in the 7-digit range minimum. You Will perform (distributed) model training. help improve/adapt model architectures based on experiment results. design new tasks and experiments. optionally: oversee activities of team members involved in data preparation. The results of your work will play a crucial role in the success of the project. Requirements Cover letter It's always a pleasure to say hi! If you could leave us 2-3 lines, we'd really appreciate that. You are expected to meet at least one of the following criteria: You have published at least one paper at NeurIPS, ICLR, or ICML - where you were the lead author or made significant conceptual & code contributions. You have significantly contributed to an LLM training effort which became newsworthy (topped a Huggingface benchmark, best in class model, etc.), preferably using multiple GPU's. You have spent at least 6 months working in a leading Machine Learning research center (e.g. at: Google Brain / Deepmind, Apple, Meta, Anthropic, Nvidia, MILA). You were an ICPC World Finalist, or an IOI, IMO, or IPhO medalist in High School. You Are A deep learning researcher, with a track record in Language Models and/or RL (candidates with a Vision or Robotics ML background are also welcome to apply). Interested in improving foundational architectures and creating new benchmarks. Experienced at hands-on experiments and model training (PyTorch, Jax, or Tensorflow). Have a good understanding of GPU architecture, memory design, and communication. Have a good understanding of graph algorithms. Have some familiarity with model monitoring, git, build systems, and CI/CD. Respectful of others Fluent in English Bonus Points Knowledge of approaches used in distributed training. Familiarity with Triton Successful track-record in algorithms & data science contests. Showing a code portfolio. Why You Should Apply Join an intellectually stimulating work environment. Be a pioneer: you get to work with a new type of "Live AI" challenges around long sequences and changing data. Be part of one of an early-stage AI startup that believes in impactful research and foundational changes. Benefits Type of contract: Full-time, permanent Preferable joining date: Immediate. The positions are open until filled – please apply immediately. Compensation: six-digit annual salary based on profile and location + Employee Stock Option Plan. Location: Remote work. Possibility to work or meet with other team members in one of our offices: Palo Alto, CA; Paris, France or Wroclaw, Poland. Candidates based anywhere in the EU, UK, United States, and Canada will be considered. If you meet our broad requirements but are missing some experience, don’t hesitate to reach out to us.

Source:  workable View original post

Location
Palo Alto, CA, USA
Show map

workable

You may also like

Workable
Full Stack Developer - Need Only Locals to GA
Job Description :   We are seeking a talented Full Stack Developer to join our team at iSoftTek Solutions Inc. As a Full Stack Developer, you will be responsible for designing, developing, and maintaining our web applications and software solutions. You will collaborate with cross-functional teams to deliver high-quality and scalable software products.   Responsibilities: Design, develop, and maintain web applications using modern technologies and frameworks Collaborate with product owners, designers, and other stakeholders to gather requirements and translate them into technical specifications Write clean and efficient code following industry best practices Perform code reviews for team members and provide constructive feedback Optimize application performance and ensure scalability Troubleshoot and debug issues reported by clients and users Stay up-to-date with the latest trends and technologies in web development   Requirements: Bachelor's degree in Computer Science, Software Engineering, or a related field Minimum of 3 years of experience as a Full Stack Developer Proficient in front-end technologies such as HTML, CSS, JavaScript, and JavaScript frameworks like React or Angular Strong knowledge of back-end technologies such as Java, Python, or Node.js Experience with databases such as MySQL or MongoDB Knowledge of version control systems like Git Excellent problem-solving and communication skills Ability to work independently and in a team environment Must be a local resident of Georgia, USA Requirements Requirement Summary: Bachelor's degree in Computer Science, Software Engineering, or related field 3+ years of experience as a Full Stack Developer Proficiency in HTML, CSS, JavaScript, and JavaScript frameworks (React, Angular) Strong knowledge of back-end technologies (Java, Python, Node.js) Experience with databases (MySQL, MongoDB) Knowledge of version control systems (Git) Excellent problem-solving and communication skills Ability to work independently and in a team environment Must be a local resident of Georgia, USA
Atlanta, GA, USA
Negotiable Salary
Workable
Data Engineer III
Position: Data Engineer III    Location: Redmond, WA 98052    Duration: 12 Months       Job Type: Contract         Work Type:  Onsite      Job Description:     Responsibilities  Build statistical models to identify patterns and trends in production testing data (telemetry, time-series) to generate correlation and predictions.  Develop an understanding of the manufacturing execution system with its multiple sources of data and relationship. Analyze the production system data and derive insights into current bottlenecks and areas of improvement, along with specific actionable recommendations  Collect, format and report periodic snapshots of the production and test data for consumption by other program managers and engineers.  Synthesize analysis and communicate insights and recommendations to various audiences  Work closely with technical program managers and engineers to provide data insights that help explain anecdotes from the manufacturing floor, and provide actionable resolution steps  Create templates and scripts to help scale and automate our ability to ingest, analyze and enable quick data reviews.  Required Skills & Experience  7+ years of related experience (data science, data engineering, data analysis, ML engineering)  4+ years of data querying languages (e.g. SQL), scripting languages (e.g. Python) or statistical/mathematical software (e.g. R, SAS, Matlab, Minitab, etc.) experience  2+ years of machine learning/statistical modeling data analysis tools and techniques, and parameters that affect their performance experience  Very Strong development experience with notable BI reporting tools (Oracle BI Enterprise Edition (OBIEE)).  Should have experience developing complex and a variety of reports.  A good candidate has strong analytical skills and enjoys working with large complex data sets.  Experience applying theoretical models in an applied environment  A good candidate can partner with business owners directly to understand their requirements and provide data which can help them observe patterns and spot anomalies. Preferred  Experience in Python, Perl, or another scripting language  Master's degree in a quantitative field such as statistics, mathematics, data science, engineering, or computer science  Experience in aerospace, manufacturing, test processes and/or test systems development  Typical Day in the Role:    Daily Schedule:   Accelerate test data collection and reviews to help reduce testing requirements across the OISL LRUs  Simplify and automate (with scripts and standard schemas) the continuous collection and formatting of such data  Develop dashboards and visualizations to help the test triage process  Develop a continuous monitoring and alarming framework that can help inform Engineers of areas needing a deeper analysis  Partner with the continuous improvement and OISL teams to accelerate the creation of analysis and tooling to establish manufacturing statistical process controls of key LRUs and test processes  Candidate Requirements:    REQUIRED SKILLS  Years of Experience: 5 years  Leadership Principle:   Customer Obsession, Learn and Be Curious, Dive Deep, Clear communication  Top 3 must-have hard skills  5+ years of related experience (data science, data engineering, data analysis, ML engineering)  scripting languages (e.g. Python  Statistical modeling data analysis tools   
Redmond, WA, USA
Negotiable Salary
Craigslist
IT Network Administrator (Financial District)
Fully onsite, temp to perm, ASAP start date. Pay is BOE 48-85 per hour • 30% Network Operations and Security Management: Responsible for the daily operations, maintenance, and troubleshooting of the branch’s network. Monitor and analyze network performance, respond promptly to and resolve operational issues and incidents. Perform fault diagnosis and resolution for various types of network failures; create and manage support tickets with third-party vendors for any type of network performance degradation or system issue. Conduct regular and ad-hoc health checks to identify and eliminate risks, ensuring the overall health of the network environment. Coordinate with maintenance vendors for hardware repair and replacement. Responsible for network-related disaster recovery activities, including drills, testing, and documentation. • 25% Network Architecture Optimization and Technical Standards Development: Design and optimize network architecture by incorporating industry best practices and aligning with headquarters’ technical standards. Continuously improve the network architecture and evaluate emerging technologies for potential adoption. Revise and refine network management technical standards in accordance with HQ requirements and local business needs. • 20% Network Asset Management: Maintain and update network topology diagrams, as well as asset records for network devices and leased lines. Manage VLANs, IP addresses, DNS, and domain environments. Responsible for network configuration management and firewall port administration. Implement lifecycle management for network devices, including procurement and replacement. Handle equipment installation, placement, cabling, labeling, power supply, stacking, and routine inspections. • 10% Network Version and Patch Management: Responsible for the installation, configuration, patching, and maintenance of network infrastructure supporting the headquarters and branch data centers. Coordinate planned network upgrades with HQ departments and third parties. Perform firmware updates for network and security devices to ensure they remain on the latest versions. • 5% Network Business and Financial Management: Liaise with network vendors and participate in budgeting, procurement, contracting, invoice review, and payment processes for network-related services, ensuring service quality, cost control, and compliance. Collaborate with the Information Security Office and Risk Management Department to oversee vendor management and monitor their adherence to service level agreements (SLAs). • 5% Collaboration and Support: Work closely with other technical teams to complete network-related tasks. Provide network support and guidance to other IT staff, promoting knowledge sharing and continuous improvement. • 5% Other Duties: Perform additional related responsibilities as assigned. Requirements • College graduate with specialization in Computer Science, Information Technology, Computer Engineering or a related discipline or equivalent experience or equivalent combination of education and experience. • 3 years of satisfactory, progressive experience in network administration in a business environment incorporating switch, router and firewall configuration, network monitoring, and experience utilizing network management tools to that are acceptable to the management of the Information technology Department. Knowledge of network topology, understanding of LANS, VPN, Wi-Fi, etc. is required. • Fluent in English and Chinese. SPECIFICATIONS: • Strong knowledge in network administration, cyber security, IT operation, inventory and patch management, incident response, and etc.; • Must be able to prioritize work and multitask in a fast-paced environment. • Require participation in periodic network devices updates outside of normal business hours and respond to network-related operation issues and incidents in a timely manner. • Communication regarding work matters with the headquarters is allowed outside of regular working hours. • Have strong written and oral communication skills in English and oral communication in Mandarin. Self-motivated and with strong responsibility. • Holding professional certifications of CCNA, CCNP or CISSP is a good plus.
100 Pearl St, New York, NY 10004, USA
$48-85/day
Workable
Freelance Software Developer (C/C++ - Rust) - AI Trainer
This opportunity is only for candidates currently residing in the specified country. Your location may affect eligibility and rates. At Mindrift, innovation meets opportunity. We believe in using the power of collective intelligence to ethically shape the future of AI. What we do The Mindrift platform connects specialists with AI projects from major tech innovators. Our mission is to unlock the potential of Generative AI by tapping into real-world expertise from across the globe. About the Role GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.  Although every project is unique, you might typically: Analyze and understand existing code in Python or C/C++ Migrate logic to idiomatic, safe Rust while preserving functionality Adapt or port the test suite and ensure behavioral equivalence Document migration steps and technical decisions How to get started Simply apply to this post, qualify, and get the chance to contribute to projects aligned with your skills, on your own schedule. From creating training prompts to refining model responses, you’ll help shape the future of AI while ensuring technology benefits everyone. Requirements You have a Bachelor's or Master’s degree in Software Development, Computer Science, or other related fields.  You have at least 3 years of professional experience with C/C++ and 1+ year of hands-on experience with Rust. You are experienced with FFI tools (bindgen, cxx) and unsafe Rust for C/C++interoperability. You bring experience testing migrated code (unit/integration/fuzz tests). You demonstrate solid understanding of systems programming (memory management, concurrency). You are skilled at refactoring legacy code and documenting migration steps. Prompt engineering experience is a strong plus. Your level of English is advanced (C1) or above. You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines. Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge. Benefits Why this freelance opportunity might be a great fit for you? Get paid for your expertise, with rates that can go up to $50/hour depending on your skills, experience, and project needs. Take part in a part-time, remote, freelance project that fits around your primary professional or academic commitments. Work on advanced AI projects and gain valuable experience that enhances your portfolio. Influence how future AI models understand and communicate in your field of expertise.
Michigan, USA
$50
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.