Browse
···
Log in / Register

Test Engineer-AI/LLM

$100,000-200,000/year

OPPO US Research Center

Palo Alto, CA, USA

Favourites
Share

Description

OPPO US Research Center is seeking a full-time meticulous and innovative AI/LLM Test Engineer to join our cutting-edge AI team. In this critical role, you will evaluate the performance, reliability, and safety of Large Language Models (LLMs) in real-world product scenarios and test end-to-end generative AI solutions. Your work will directly shape how users experience AI-powered features by ensuring robustness, accuracy, and alignment with product goals. This is a unique opportunity to pioneer testing methodologies for next-generation AI systems at the forefront of technology. We are also seeking a Contractor based LLM Evaluation & QA Engineer to support the testing and validation of large language model (LLM)-powered applications. You will help implement test strategies, execute evaluation workflows, and assist in model performance validation across diverse generative AI use cases. This contract role is ideal for someone with hands-on experience in AI/ML evaluation, QA engineering, or data analysis who wants to deepen their exposure to generative AI systems. Requirements Full-time position requirement: Core Testing & Evaluation Design and execute performance tests for LLMs across diverse product use cases (e.g., chatbots, content generation etc.). Develop automated test frameworks to evaluate LLM outputs for accuracy, bias, safety, and coherence. Conduct end-to-end testing of integrated generative AI solutions, including APIs, data pipelines, and user interfaces. Optimization & Validation Collaborate with ML engineers to validate fine-tuned models and optimize prompts for target scenarios. Analyze model failures, edge cases, and adversarial inputs to identify risks and improvement areas. Benchmark LLM performance against industry standards and product-specific KPIs. Collaboration & Quality Assurance Partner with product, engineering, and research teams to define test requirements and acceptance criteria. Document defects, performance metrics, and test results to drive data-driven improvements. Advocate for AI ethics and safety through rigorous testing of fairness, bias mitigation, and content moderation. Innovation & Tooling Build scalable tools for synthetic test data generation, prompt variation testing, and automated evaluation workflows. Stay current with advancements in generative AI testing, including red-teaming techniques and evaluation frameworks (e.g., HELM, Dynabench). Propose novel testing strategies for emerging challenges (e.g., hallucinations, context drift). Basic Qualifications: Bachelor’s degree in Computer Science, Data Science, Engineering, or a related technical field, or equivalent practical experience. 1+ years of experience in software testing, data science, or ML validation, with exposure to AI/ML systems. Proficiency in Python and testing frameworks (e.g., PyTest, Selenium). Hands-on experience evaluating LLMs in production environments (e.g., GPT, Claude, Llama, Gemini). Strong analytical skills for dissecting model behavior, statistical performance, and failure modes. Familiarity with cloud platforms (GCP, Azure, or AWS) and MLOps tooling (e.g., MLflow, Weights & Biases). Experience with version control (Git) and agile development methodologies. Preferred Qualifications: Master’s degree in AI, Machine Learning, or a related field. Expertise in prompt engineering, LLM fine-tuning (e.g., LoRA, RLHF), or optimization techniques. Experience with automated evaluation tools (e.g., LangChain, TruLens) or LLM-specific test suites. Knowledge of data pipelines, SQL/NoSQL databases, and API testing (e.g., Postman). Background in statistics, quantitative analysis, or data visualization for test insights. Contributions to AI safety/ethics initiatives or open-source LLM evaluation projects. Experience testing mobile-integrated AI solutions (Android/iOS). Contractor position requirements: Testing & Evaluation Support: Execute pre-defined performance tests for LLMs across various tasks (e.g., summarization, Q&A, chatbot flows). Run scripted evaluations to assess outputs for factuality, coherence, and safety. Perform manual and automated test execution on APIs and LLM-integrated user interfaces. Prompt & model validation: Assist ML engineers in evaluating prompt variations and prompt-tuning outcomes. Log and analyze failure cases, anomalies, and edge cases based on provided guidelines. Collabration & Documentation Work with QA leads, product managers, and ML engineers to understand test goals and criteria. Report defects, compile evaluation summaries, and maintain testing logs. Tooling & Antomation: Use existing internal tools or frameworks to automate test runs and result collection. Contribute to prompt generation, input templating, or result tagging processes. Basic Qualifications: Bachelor's degree or equivalent work experience in a technical field (e.g., Computer Science, Engineering, Data Science). 6+ months experience in software QA, data labeling, LLM evaluation, or ML testing projects. Basic Python proficiency, especially for data processing and automation tasks. Familiarity with LLMs (e.g., GPT, Claude, Gemini) and prompt-based outputs. Comfortable working with tools like Jupyter, Postman, or testing dashboards. Detail-oriented with good documentation habits. Contractor Details: Duration: Long term Rate: Commensurate with experience Conversion Opportunity: High-performing contractors may be considered for full-time roles Benefits OPPO is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. The US base salary range for this full-time position is $100,000-$200,000 + bonus + long term incentives benefits. Our salary ranges are determined by role, level, and location.

Source:  workable View original post

Location
Palo Alto, CA, USA
Show map

workable

You may also like

Workable
Mid-Level Apparel Technical Designer
Miller International, Inc., designer of Cinch® and Cruel®, is searching for its newest team member at our headquarters in Denver, Colorado! Our next Mid-Level Apparel Technical Designer will need to possess excellent team skills and an understanding of garment fit and construction. The successful candidate will be responsible for supporting the entire Product Development Department. The person who fills this position must also have a passion for the work they do and a strong desire to learn and grow. At Miller International, our employees enjoy a fun, casual, laid-back atmosphere. If you have a base amount of Technical Design experience or an educational background, then this is your opportunity to be a part of something great! We want to hear from you if you possess the following skills, abilities, and qualifications: This position is in-office only. Salary: $55,000- $65,000 As a Mid-Level Apparel Technical Designer, you would be responsible for: Posting design sheets onto FTP site and notify vendors for costing. Create and update technical packets in PLM ensuring that all details are commercially viable with the factories and are the most cost effective. Review samples and/or paper patterns from contractors for design accuracy and integrity. Communicate fit comments to overseas and domestic vendors in a timely manner. Execute design and fit intent into bulk production while maintaining quality and cost standards. Lead fit sessions and take initiative as the fit expert of a specific product category or brand. Ensure size and fit consistency within the brand and across product categories. Ensure deadlines are met. Track and manage workflow and workload for own products. Ensure availability of fit models. Build and maintain fit base libraries including sketches and finished garment measurements. Collaborate with product development and design teams on creation of style: from initial specifications and construction detail to fit intent. Foster open communication and team environment with all business partners. Identify and proactively engage business partners when issues arise with recommendations for viable options/solutions. Maintain a positive work atmosphere by acting and communicating in a manner that enables you to get along with customers, clients, co-workers and management. Requirements Bachelor's degree (B. A.) from four-year college or university At least 3 years related experience and/or training Proficient in use of Illustrator required Working knowledge of Photoshop Strong knowledge in patternmaking, including grading, construction, and fit. Ability to analyze quality and maintain standards with contractors. Ability to produce computer generated technical sketches. Ability to multi-task: Use the combination of organization, time management, scheduling and preparation to get multiple tasks completed by the established deadlines. Self-motivated with a strong sense of urgency; strong sense of time awareness. Thorough attention to detail and organizational skills. Technical knowledge of fabrics, finishes, trims, and techniques. Excellent interpersonal, verbal, and written communication skills. Creative approach to problem solving. Team-oriented, proactive attituded. Other tasks as assigned. Benefits Interested yet? Miller International offers spectacular benefits to ensure its employees are happy and healthy, and the Company firmly believes in the importance of maintaining a proper work-life balance. If this sounds like a position you genuinely want to fill, send us your resume, portfolio, and cover letter telling us about yourself and why you would like to work with us. Out-of-state candidates are welcome to apply, as long as you are willing to relocate to Denver, Colorado. Our success lies in the hands of our dedicated and loyal staff – and we only employ the best! We pride ourselves on a rich history of over 100 years in the making that embraces the tradition of hard work, distinction and providing unsurpassed quality products to our customers. Since 1918 Miller International has matured and consistently evolved to become what it is today: One of the most successful privately owned Companies in the Western Industry whose brands continue to gain impressive popularity and growth. We do it by treating each other with respect, and we do it all as a team that feels more like a family. We at Miller are guided by our Core Values and use them to measure the appropriateness of decisions, whether it be with vendors, customers or employees. The Core Values were created and approved by our employees as an affirmation that they are willing to be part of a Company that is guided by these principles. We can’t wait to hear from you! Check us out at: www.miller-international.com Application Deadline: 9/30/2025
Denver, CO, USA
$55,000-65,000/year
Workable
Product Manager
Well, hello there 👋 Screencastify is a leading educational technology company dedicated to improving communication and learning outcomes with video. Our primary focus is on the K-12 education sector in the United States and we are critical in helping scale a teacher and improve student outcomes all while being an easy to use solution. Screencastify is used by over 15M people and is seeking a dynamic and results-oriented Product Manager to join our Squad! About this role  We built the simplest and most reliable screen recorder in the world, but that’s only the beginning. Our near future is full of ambitious new goals, features, and products that will enable us to further improve how we provide service to our users and accelerate our already fast growth. ​  At Screencastify, we empower our Product team to solve complex customer and business challenges in ways that delight our users and drive success. As an experienced Product Manager joining our cross-functional team, you will collaborate with the team to discover, define and deliver innovative features that directly impact teachers, educators, students, and companies around the world. Your expertise in SaaS, and ability to thrive in a fast-paced start-up environment, will be essential as you champion user needs and drive product initiatives that enhance our offerings and elevate the learning experience.  Why is this role special? Have a massive impact. As part of a small Product team you will own a specific suite of products and be instrumental in the design of features and products that will be seen and used by millions of people. Work for our users. Above all else, you will be an advocate for our users and will get to know their voices and stories better than most people in the company. Join us and be a critical part of our growth story.We're bootstrapped, profitable, and support tens of millions of users, which gives us a huge green field to work with. You'll join at the perfect time to shape how we grow from here. What you'll do: AI First: Implement in the product and streamline processes, enhance operational efficiency and drive business outcomes. Gather insights about customer needs through a combination of qualitative and quantitative research. Collaborate with cross-functional teams including Product Designers and Tech teams to translate customer insights into feature ideas that are technically feasible, aligned with business goals, and usable by customers. Drive product adoption through direct user outreach and training customer facing-teams.  Frequently test prototypes and feature ideas to eliminate  committing to production quality versions. Lead cross-departmental feature release teams to successfully bring your team’s work to market. Work with company leaders to define annual and quarterly OKRs. Monitor and report  progress on  product initiatives. Stay up-to-date on the trends and forces shaping the K-12 market and beyond. Requirements You're perfect for this role if you: Bring 4+ years of hands-on experience in product management, specifically with SaaS products, demonstrating a track record of delivering impactful solutions.  Are comfortable building products from 0 to 1, with a deep hands-on approach. Have experience in the Edtech industry, which is a plus. Possess a solid understanding of product discovery and delivery methods. Have a proven ability to collaborate with engineers, designers, sales, marketing, support and company leaders. Thrive in the fast-paced, ever-changing environment of a start-up. Are flexible, dedicated, and continuously curious. Working at Screencastify At Screencastify, we are results focused and here to improve communication, teaching and learning globally. This isn’t an easy feat but it is important for our future. We value accountability, commitment, and speed. We take our responsibility to our customers very seriously, so when we miss a deadline or slow down, it matters.  We’re a competitive culture and strive for speed and innovation. We are problem solvers, don’t point fingers and rather enjoy working together to bring solutions to the forefront. Join a company that has millions of users, a strong brand all by being very entrepreneurial and embodying the start up mindset. We love a challenge and pushing the world forward with creativity, ingenuity and out of the box thinking. People are everything and we want to work in a company of deeply good people who treat their colleagues exceptionally well. Rule #1: Be a good person. This is a Chicago-based hybrid position with 3 days a week in the office. Compensation The expected annual base salary for this role is anticipated to start at $100,000. Final compensation may vary based on experience and qualifications. Benefits Competitive Compensation. We take a data-driven approach to our compensation strategy so all employees are paid competitively and fairly. 401(k) & Annual Performance Bonus Opportunity. We want to invest in present you and future you, which is why we offer a 401(k) match + Annual Performance Bonus opportunity. Flexible Time Off (FTO) Policy. We recognize that time off to rest and recharge is important. The Flexible Time Off Policy (FTO) is designed for our employees to do just that -- balance work and life while maintaining well-being. Parental Leave. Generous paid time off for parents to bond with the newest addition to their family! Medical, Dental, & Vision Insurance. We offer comprehensive health benefits, including medical, dental, and vision insurance. Plus, all employees receive a free One Medical membership. Divvy Bike Membership. If you’re in Chicago, take advantage of an annual Divvy membership -- on us. At Screencastify, we foster an inclusive, supportive, fun, and challenging team environment. We value having a team that is made up of a diverse set of backgrounds and respect the healthy expression of diverse opinions. We embrace experimentation and the examination of all kinds of ideas through reasoning and testing. Come join us as we continue to change the world through video. Screencastify is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, sexual orientation, national origin, age, genetic information, gender identity, disability, Veteran status or any other characteristic protected by federal, state or local law.
Chicago, IL, USA
$100,000/year
Workable
Production Supervisor
Parallel Employment is on the lookout for a skilled and dedicated 2nd Shift Production Supervisor for a food manufacturing client in Forestville, NY . In this role, you will be responsible for overseeing the daily operations of the production floor, ensuring that production goals are met while maintaining high standards of quality and safety. This position offers an exciting opportunity to lead a team and drive efficiency in our manufacturing processes. Pay starts from $23-$29/HR depending on experience. Starting time range from 2PM-5PM. Key Responsibilities: Supervise and coordinate the daily activities of the production team. When production finishes early production staff transitions to sanitation - inspect work to ensure proper cleaning/sanitizing Operate juice processing line Ensure adherence to production schedules and maintain optimal production levels. Enforce safety regulations and promote a safe working environment. Conduct regular assessments of production quality and implement improvements as necessary. Provide training and development for employees to enhance their skills and performance. Collaborate with other departments to facilitate smooth operations and address any production-related issues. Prepare and maintain accurate production reports and documentation. Requirements Minimum of 2-3 years of experience in a supervisory role within a manufacturing environment. Strong leadership and team-building skills. 8 - 10 hour shifts Excellent communication and interpersonal abilities. Proficient in Microsoft Office Suite and production management software. Ability to work well under pressure and manage multiple tasks. High school diploma or equivalent; further education in management or a related field is preferred. Benefits Equal Opportunity Employer #ind456
Forestville, NY 14062, USA
$23/hour
Craigslist
Production Assistant – Ruby’s Dragons (petaluma)
Production Assistant – Ruby’s Dragons (Petaluma, CA) Part-time | Flexible hours | Starting at $18–$22/hour (DOE) Do you love hands-on work, organization, and helping bring magic to life?
 Ruby’s Dragons is a small, family-run 3D printing company that creates colorful articulated dragons, tiny animals, and collectible toys. We’re looking for a reliable, detail-oriented Production Assistant to help us keep our growing workshop running smoothly. You’ll help with: * Finishing 3D prints using a heat gun and removing supports * Sorting completed prints into bins by style and color * Maintaining organized inventory * Cleaning build plates and keeping the workspace tidy (sweeping, trash, etc.) * Starting prints based on what’s low in stock (using a simple system/checklist) * Counting, packing, and shipping orders * Troubleshooting basic printer issues and flagging when something’s off * Organizing filament and keeping supplies stocked Bonus points if you can (or are excited to learn to): * Do basic printer maintenance (changing nozzles, clearing jams, etc.) You’re a great fit if you: * Are dependable and take pride in your work * Enjoy working with your hands and have good attention to detail * Like organizing and keeping things clean * Are comfortable with technology and willing to learn new systems * Can lift 20 lbs and stand for stretches of time Hours: Flexible, 10–20 hours per week to start (Mon–Fri daytime preferred)
 Location: Our Petaluma home workshop
 Pay: $18–$22/hour depending on experience If you’re ready to join a fun, creative small business where every dragon matters, send us a quick email introducing yourself and why this sounds like a good fit. Experience with 3D printing or small business production is a plus, but not required — we’ll train the right person! To apply: 
Email with the subject “Production Assistant Application – [Your Name]”
 Please include:
 ✅ A short introduction about you 
✅ Your availability
 ✅ Any relevant experience
1524 McGregor Ave, Petaluma, CA 94954, USA
$18-22/hour
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.