Browse
···
Log in / Register

Reinforcement Learning Engineer

Negotiable Salary

Code Metal

Boston, MA, USA

Favourites
Share

Description

At Code Metal AI, you’ll be part of a world-class team with talent from MIT, OpenAI and other top companies, focused on pioneering work in large language models (LLMs) and code generation. Our projects directly involve leading chip manufacturers, applying advanced AI to solve meaningful, practical challenges with real-world impact. This role bridges two critical areas: Production Build and maintain robust distributed training systems using PyTorch (2+ years experience required). Design and implement scalable data curation and quality assurance pipelines to ensure top-tier training datasets. Develop orchestration tools that manage complex workflows across large-scale AI model training and evaluation. Research Drive innovation by developing evaluation frameworks and reinforcement learning solutions, including recent advancements in Reinforcement Learning with Human Feedback (RLHF). Engage with frontier research through open-source projects and potential publications, applying RLHF to Large Language Models (LLMs), ideally focusing on code generation tasks. Requirements 2+ years experience in distributed training, preferably with PyTorch. Strong background in reinforcement learning, with recent RLHF experience highly preferred. Proven ability to build data curation and quality assurance pipelines. Experience with evaluation framework development. Ideally, experience across both data pipeline and orchestration sides. Eligible for TS/SCI clearance. Nice to have: Contributions to open-source AI or ML projects. Published work or demonstrable research experience in related fields. Hands-on experience applying RLHF to LLMs, especially for code generation. Experience with large-scale synthetic data generation. Benefits Health care plan with 100% premium coverage, including medical, dental, and vision. 401k with 5% matching. Paid Time Off (Uncapped Vacation, plus Sick & Public Holidays). Flexible hybrid work arrangement. Relocation assistance for qualifying employees.

Source:  workable View original post

Location
Boston, MA, USA
Show map

workable

You may also like

Workable
Spear Data Mapping Specialist
What is Spear Data Mapping? Spear is a specialized database and scheduling platform commonly used in transportation, transit operations, and related IT projects. Spear Data Mapping Specialist Location: Brooklyn, NY  Schedule: Fully Remote, 9 AM-5 PM, Monday–Friday. Compensation: $50–$95/hour (1099) Term: Long-term opportunity Description We are seeking a skilled and detail-oriented Spear Data Mapping Specialist with functional and technical expertise to support data-driven projects. This role requires hands-on knowledge of the Spear database structure and data formats, with the ability to analyze, validate, and maintain complex datasets. The successful candidate will ensure accuracy, consistency, and alignment of data to meet project and client requirements, collaborating closely with technical teams and business stakeholders. Key Responsibilities: Perform data mapping, analysis, and validation activities using the Spear system. Review, interpret, and document data structures, mapping rules, and workflows. Collaborate with technical teams to ensure accurate integration of Spear database formats into system processes. Troubleshoot data-related issues, identify discrepancies, and recommend corrective actions. Maintain well-organized documentation to support project reporting, audits, and quality assurance. Support functional and technical stakeholders by providing data insights and clarifications. Requirements Minimum Qualifications: 2–5 years of experience in data mapping, database analysis, or data integration. Bachelor’s degree in Information Systems, Computer Science, Data Analytics, or a related field required. Hands-on knowledge of the Spear system, including database structure and data formats. Strong analytical and problem-solving skills with keen attention to detail. Ability to communicate effectively with both technical and non-technical stakeholders. Eligible to work in the U.S. and pass a background check. Preferred Qualifications: Local to Brooklyn, NY, and able to support onsite work (hybrid schedule). Experience supporting federal or enterprise-level IT projects. Training or certifications in database management, data integration, or related areas. Background in transportation-related projects or systems is preferred but not required. Benefits Long-term opportunity. Schedule: Fully Remote, 9 AM-5 PM, Monday–Friday. Compensation (1099): $50–$95 per hour (based on experience).
Brooklyn Heights, Brooklyn, NY, USA
$50-95/day
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.