Browse
···
Log in / Register

Test Engineer-AI/LLM

$100,000-200,000

OPPO US Research Center

Palo Alto, CA, USA

Favourites
Share

Description

OPPO US Research Center is seeking a full-time meticulous and innovative AI/LLM Test Engineer to join our cutting-edge AI team. In this critical role, you will evaluate the performance, reliability, and safety of Large Language Models (LLMs) in real-world product scenarios and test end-to-end generative AI solutions. Your work will directly shape how users experience AI-powered features by ensuring robustness, accuracy, and alignment with product goals. This is a unique opportunity to pioneer testing methodologies for next-generation AI systems at the forefront of technology. We are also seeking a Contractor based LLM Evaluation & QA Engineer to support the testing and validation of large language model (LLM)-powered applications. You will help implement test strategies, execute evaluation workflows, and assist in model performance validation across diverse generative AI use cases. This contract role is ideal for someone with hands-on experience in AI/ML evaluation, QA engineering, or data analysis who wants to deepen their exposure to generative AI systems. Requirements Full-time position requirement: Core Testing & Evaluation Design and execute performance tests for LLMs across diverse product use cases (e.g., chatbots, content generation etc.). Develop automated test frameworks to evaluate LLM outputs for accuracy, bias, safety, and coherence. Conduct end-to-end testing of integrated generative AI solutions, including APIs, data pipelines, and user interfaces. Optimization & Validation Collaborate with ML engineers to validate fine-tuned models and optimize prompts for target scenarios. Analyze model failures, edge cases, and adversarial inputs to identify risks and improvement areas. Benchmark LLM performance against industry standards and product-specific KPIs. Collaboration & Quality Assurance Partner with product, engineering, and research teams to define test requirements and acceptance criteria. Document defects, performance metrics, and test results to drive data-driven improvements. Advocate for AI ethics and safety through rigorous testing of fairness, bias mitigation, and content moderation. Innovation & Tooling Build scalable tools for synthetic test data generation, prompt variation testing, and automated evaluation workflows. Stay current with advancements in generative AI testing, including red-teaming techniques and evaluation frameworks (e.g., HELM, Dynabench). Propose novel testing strategies for emerging challenges (e.g., hallucinations, context drift). Basic Qualifications: Bachelor’s degree in Computer Science, Data Science, Engineering, or a related technical field, or equivalent practical experience. 1+ years of experience in software testing, data science, or ML validation, with exposure to AI/ML systems. Proficiency in Python and testing frameworks (e.g., PyTest, Selenium). Hands-on experience evaluating LLMs in production environments (e.g., GPT, Claude, Llama, Gemini). Strong analytical skills for dissecting model behavior, statistical performance, and failure modes. Familiarity with cloud platforms (GCP, Azure, or AWS) and MLOps tooling (e.g., MLflow, Weights & Biases). Experience with version control (Git) and agile development methodologies. Preferred Qualifications: Master’s degree in AI, Machine Learning, or a related field. Expertise in prompt engineering, LLM fine-tuning (e.g., LoRA, RLHF), or optimization techniques. Experience with automated evaluation tools (e.g., LangChain, TruLens) or LLM-specific test suites. Knowledge of data pipelines, SQL/NoSQL databases, and API testing (e.g., Postman). Background in statistics, quantitative analysis, or data visualization for test insights. Contributions to AI safety/ethics initiatives or open-source LLM evaluation projects. Experience testing mobile-integrated AI solutions (Android/iOS). Contractor position requirements: Testing & Evaluation Support: Execute pre-defined performance tests for LLMs across various tasks (e.g., summarization, Q&A, chatbot flows). Run scripted evaluations to assess outputs for factuality, coherence, and safety. Perform manual and automated test execution on APIs and LLM-integrated user interfaces. Prompt & model validation: Assist ML engineers in evaluating prompt variations and prompt-tuning outcomes. Log and analyze failure cases, anomalies, and edge cases based on provided guidelines. Collabration & Documentation Work with QA leads, product managers, and ML engineers to understand test goals and criteria. Report defects, compile evaluation summaries, and maintain testing logs. Tooling & Antomation: Use existing internal tools or frameworks to automate test runs and result collection. Contribute to prompt generation, input templating, or result tagging processes. Basic Qualifications: Bachelor's degree or equivalent work experience in a technical field (e.g., Computer Science, Engineering, Data Science). 6+ months experience in software QA, data labeling, LLM evaluation, or ML testing projects. Basic Python proficiency, especially for data processing and automation tasks. Familiarity with LLMs (e.g., GPT, Claude, Gemini) and prompt-based outputs. Comfortable working with tools like Jupyter, Postman, or testing dashboards. Detail-oriented with good documentation habits. Contractor Details: Duration: Long term Rate: Commensurate with experience Conversion Opportunity: High-performing contractors may be considered for full-time roles Benefits OPPO is proud to be an equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. The US base salary range for this full-time position is $100,000-$200,000 + bonus + long term incentives benefits. Our salary ranges are determined by role, level, and location.

Source:  workable View Original Post

Location
Palo Alto, CA, USA
Show Map

workable

You may also like

Currier Plastics, Inc.
Validation Engineer - New Product Development
Auburn, NY 13021, USA
Job Summary: The Validation Engineer is responsible for executing and documenting equipment and process validations (IQ, OQ, PQ) on various types of manufacturing equipment and end-of-line processes. This role involves interfacing with clients regarding all qualification details, technical issues and supporting project deliverables. Depending on the level of the position, the Validation Engineer may lead QA projects of moderate scope and duration or independently perform detailed engineering tasks.    Essential Duties and Responsibilities: Fulfill validation engineering role on New Product Development team (conception to launch) and sustaining project processes related to validations. Complete planning and execution of design controls, risk management, test method development and validation, and design verification/validation for new and existing products. Risk management: DFMEA, PFMEA, PRA Validations: IQ, OQ, PQ etc. Create and support the development and validation of new test methods/equipment. Aid in creating design controls for validation and execution of verification of design controls per project requirements. Support Regulatory Affairs with creating submissions and responding to submission questions as needed. Regulatory realms which this engineering role will be involved in: ISO 13485, cGMP, Sterilization (ISO 11135, ISO11137, etc.), standard Ensure compliance with industry-specific regulations and standards. Primary responsibility is New Product Development; secondary responsibility is Continuous improvement of validation processes. Continued support and optimization of Currier New Product Development deliverables and processes. Participate and lead New Product Development Phase Reviews Develop the strategy and manage the execution of Validation protocols, which include, but are not limited to, tool sampling, tool qualification, automation SAT, FAT acceptance criteria, Generate validation protocols, manage validation protocol test execution, investigate root cause, analyze data, and create reports for the execution of protocols. Provide directions to quality, process and production engineering and or technicians supporting assigned projects for validation purposes. Support the transfer of new product development projects from development to production. Ability to read, analyze, and interpret technical procedures or government regulations. Ability to write reports, correspondence, and procedure manuals. Ability to effectively present information and respond to questions from groups of department heads, managers, and clients. Ability to analyze and report data in a comprehensive and cohesive manner, which documents results, deviations, and corrective actions in an organized manner. Requirements ·          Bachelors Engineering Degree (optional): Mechanical, biomedical, chemical, industrial, plastics, materials, etc. ·          Experience in medium/high volume Medical Pharmaceutical Industry is highly desirable. ·          Design Assurance: 2-5 years. ·          Test Method Development: 2-5 years. ·          Med-tech New Product Development: 2-5 years ·          Generate test protocols: 2-5 years. ·          Technical Report Writing: 2-5 years. ·          Plastics IM, EBM, ISBM, & IBM experience is desirable. ·  Excellent verbal and written communication skills – including the ability to contribute technically to and work within cross-functional team environments. ·          High personal/professional integrity, trustworthiness, strong work ethic, and ability to work independently. ·          Ability to work in a dynamic and collaborative environment and maintain a results-oriented, positive, “can-do” attitude and ability to work well under pressure. ·          Strong organizational and multitasking skills, with a high level of attention to detail and a proactive approach to problem-solving. ·          Ability to read, analyze, and interpret technical procedures or government regulations. ·          Ability to write reports, correspondence, and procedure manuals. Ability to effectively present information and respond to questions from groups of department heads, managers, and clients. ·          Proficiency in Microsoft Office Suite (Outlook, Word, Excel, PowerPoint, Minitab). Benefits Standard Health, Dental, Vision Benefits. Generous PTO. 401K Match.
Negotiable Salary
Tech Intern – Entry-Level IT Support (Citrus Heights)
8501 Auburn Blvd, Citrus Heights, CA 95610, USA
Title: Tech Intern – Entry-Level IT Support (Citrus Heights / Sacramento) Location: Citrus Heights, CA Compensation: $17–$19/hour (Full-Time, M–F, 8 AM–5 PM) Employment Type: Full-Time Job Title: Tech Intern Company: Prestwood IT Solutions Remote Work: No – In-Office Only Description: Prestwood IT is hiring a Tech Intern to join our in-office team in Citrus Heights. This entry-level IT role is ideal for someone looking to start a professional career in tech and learn directly from senior engineers in a structured, real-world environment. What You’ll Do: As a Tech Intern, you’ll assist the Tech Division with daily technical tasks and learn real-world IT operations. Responsibilities include: Perform computer cleanups and tune-ups, malware scans, and Windows optimization Help with hardware setup for desktops, laptops, and printers Assist with Microsoft 365 installations and basic configuration Troubleshoot entry-level IT issues (under guidance) Run diagnostics and document technical findings Help with office hardware prep, cabling, and labeling Work in our internal ticketing system and track time Shadow senior techs on onsite visits when appropriate Requirements: Interest in IT support or systems engineering Familiar with computers, Windows, Mac, and Microsoft 365 Strong communication skills, eagerness to learn, and customer service skills Valid driver's license (This position requires local travel to client sites, so a valid driver's license and reliable transportation is required) Must work in office M–F, 8 AM–5 PM Pay and Benefits: $17–$22/hour depending on experience Structured internship with hands-on experience Real-world training from seasoned engineers Potential growth to Systems Engineer I Apply: Please email your resume with the subject Tech Intern Application – [Your Name]
$17-19
TetraScience
Senior Product Marketing Manager - Scientific Data & AI Cloud Platform
Boston, MA, USA
Who We Are  TetraScience is the Scientific Data and AI Cloud company. We are catalyzing the Scientific AI revolution by designing and industrializing AI-native scientific data sets, which we bring to life in a growing suite of next gen lab data management solutions, scientific use cases, and AI-enabled outcomes.  TetraScience is the category leader in this vital new market. In the last year alone, the world’s dominant players in compute, cloud, data, and AI infrastructure have converged on TetraScience as the de facto standard, entering into co-innovation and go-to-market partnerships: Latest News and Announcements | TetraScience Newsroom In connection with your candidacy, you will be asked to carefully review the Tetra Way letter, authored directly by Patrick Grady, our co-founder and CEO. This letter is designed to assist you in better understanding whether TetraScience is the right fit for you from a values and ethos perspective.  It is impossible to overstate the importance of this document and you are encouraged to take it literally and reflect on whether you are aligned with our unique approach to company and team building. If you join us, you will be expected to embody its contents each day.  Who You Are We are seeking a strategic and technically astute Product Marketing Manager to lead the go-to-market strategy for the Tetra Scientific Data and AI Cloud platform. You bring a strong product orientation and storytelling instinct, grounded in real-world experience at the intersection of data, cloud, and life sciences. You understand how to position a platform that’s as relevant to CDOs and Heads of IT as it is to scientists, data engineers, and AI practitioners. You are a systems thinker with an eye for simplification and scale. You understand the critical importance of data architecture and FAIR principles in enabling scientific AI, and you can articulate the differentiated value of a cloud-native, vendor-neutral, extensible platform approach. You thrive in high-growth, cross-functional environments and are motivated by the opportunity to build category-defining products and narratives. What You Will Do In this role, you will define and drive the product marketing strategy for the Tetra Scientific Data and AI Cloud platform. Your work will empower the world's leading biopharma companies to replatform their scientific data, enabling transformational outcomes in discovery, development, manufacturing, and quality control. You will collaborate with Product, Engineering, Sales, and Strategic Partners to craft clear, compelling positioning, messaging, and sales enablement materials. You’ll also help shape the narrative for our ecosystem, including integrations with major cloud, AI, and data platform partners like Databricks, Snowflake, AWS, Microsoft, and NVIDIA. This is a pivotal role that combines deep technical understanding with go-to-market acumen and a bias for execution. Responsibilities Own the platform product marketing strategy across all technical personas (scientific IT, data leaders, AI/ML). Define and continuously refine positioning, messaging, and value propositions for our cloud platform, developer and data capabilities and architecture. Create compelling product marketing content—solution briefs, technical explainer videos, competitive battlecards, web copy, white papers, and thought leadership assets. Collaborate with sales, alliances, and field teams to deliver training, tools, and content that accelerate pipeline and sales velocity. Support partner co-marketing efforts with major platform and AI partners (e.g., Snowflake, Databricks, NVIDIA, AWS, Microsoft). Serve as the voice of the platform in customer briefings, industry events, webinars, and analyst conversations. Track key metrics to evaluate market opportunity / share / impact impact, adoption patterns, and ecosystem growth. Requirements Formal education in a scientific or technical discipline (e.g., life sciences, data science, computer science, engineering). 7+ years of experience in product marketing for data platforms, developer tooling, or cloud-based enterprise products in the life sciences. Strong knowledge of lab informatics, FAIR data principles, cloud data architectures, and scientific R&D workflows. Demonstrated ability to translate platform capabilities into clear, differentiated customer value. Experience working with ecosystem partners (cloud, AI, analytics) and supporting partner go-to-market motions. Exceptional writing, communication, and presentation skills. Strong collaboration skills and experience working with cross-functional teams in high-velocity environments. Benefits 100% employer-paid benefits for all eligible employees and immediate family members Unlimited paid time off (PTO) 401K Flexible working arrangements - Remote work  Company paid Life Insurance, LTD/STD A culture of continuous improvement where you can grow your career and get coaching We are not currently providing visa sponsorship for this position
Negotiable Salary
Audio Video Low Voltage Installer (Elk Grove Village)
201 Victoria Ln, Elk Grove Village, IL 60007, USA
Trifecto Audio Video was formed in 2007. We are a luxury AV company that installs flat panel TV, Micro LED video walls, lighting control, motorized shading products, smart home systems, home theater, wired and wireless networks, surveillance systems, whole home music systems, and outdoor entertainment. We are located in Elk Grove Village and are looking to add to our team. The position includes the installation and integration of both residential and light commercial audio video systems. Job Responsibilities: Work with other team members to integrate/program high end low voltage audio video system. Experience preferred but not required. Car audio or 12-volt installers have an easy transition into this field and position. Competitive Pay: $22 to $40 an hour. Compensation will be based on experience and performance. Health Insurance 401K Paid Time Off Minimum Job Requirements: Time management. Willingness to learn and advance within the company structure. Clean and professional appearance. Strong communication skills. Basic understanding of desktop and mobile computing. Comfortably climb ladders. Ability to lift and carry heavy equipment. Ability to work in crawl spaces or attics. Maintain a clean and organized work vehicle and working environment. Must have a clean driving record and valid driver's license. Successfully pass a background check. Must be willing to work on-site throughout Illinois and occasionally in Wisconsin, Indiana, and Michigan. Preferred Skills: 1-3 years of AV installation and programming experience. Strong attention to detail. Critical thinking. Troubleshooting skills. Experience with Lutron, Control4, URC, Sonos, networking.
$22-40
SEON Technologies
Senior Product Manager, Orchestration and Integrations (Hybrid)
Austin, TX, USA
SEON is the leading fraud prevention system of record, catching fraud before it happens at any point across the customer journey. Trusted by over 5,000 global companies, we combine your company’s data with our proprietary real-time signals to deliver actionable fraud insights tailored to your business outcomes. We deliver the fastest time to value in the market through a single API call, enabling quick and seamless onboarding and integration. By analyzing billions of transactions, we’ve prevented $200 billion in fraudulent activities, showcasing why the world’s most innovative companies choose SEON. The Product Manager for Orchestration and Integrations is a strategic operator, problem solver, and mentor responsible for shaping SEON's integration platform and marketplace strategy as we evolve from a point solution to a comprehensive System of Record for risk management. As part of a fast-moving, challenger-minded organization, this role requires curiosity, competitive awareness, and a willingness to break traditional molds to outpace industry incumbents. The ideal candidate has a knack for spotting integration opportunities before others do, moving quickly from concept to execution, and inspiring cross-functional teams to build a marketplace ecosystem that differentiates SEON in the fraud prevention landscape. In addition to driving the integration platform strategy, this role also mentors junior Product Managers while providing data-driven counsel to executive leadership. This is a hands-on position for someone who thrives on good mischief and finding creative, unexpected ways to deliver superior integration capabilities in a market of slow-moving competitors. This role is based in our Austin, TX office with a hybrid schedule. Requirements WHAT YOU’LL DO: Product Strategy & Rapid Execution Identify and capitalize on emerging trends, industry shifts, and weak spots in competitors' integration strategies to maintain SEON's edge Drive fast ideation and execution cycles for our integration platform, ensuring rapid deployment of innovative data partnership solutions Create and manage structured roadmaps for high-value data integrations, holding teams accountable for delivering impactful releases Orchestration Platform & Technical Implementation Craft and maintain PRDs for our integration platform that balance vision with precision, ensuring clarity while allowing room for experimentation Personally perform hands-on integration work, including prototyping APIs, validating technical concepts, and troubleshooting implementation challenges Design and architect flexible integration patterns that can scale across diverse data partners and use cases Identify, prioritize, and drive implementation of strategic data partnerships across categories including device intelligence, identity verification, and alternative data Cross-functional Leadership & Organizational Agility Break down silos between engineering, business development, and partner teams to rally around a shared integration vision Lead high-impact meetings with potential integration partners, diving into technical specifications and integration requirements Serve as a strategic advisor to leadership on integration strategies and marketplace development Technical Expertise & Solution Architecture Apply deep knowledge of API design patterns, authentication methods, and data exchange protocols to design optimal integration solutions Develop proof-of-concepts and technical prototypes to validate integration approaches before full implementation Evaluate partner APIs and data structures for quality, performance, and compatibility with SEON's platform Data-Driven Market Awareness & Customer Insights Stay ahead of competitor integration moves and market shifts, ensuring SEON moves before the competition even sees the opportunity Leverage customer insights to shape integration priorities, making sure SEON's marketplace stays relevant, effective, and a step ahead Define and track KPIs for integration success, ensuring that experimentation leads to measurable business impact   WHAT YOU’ll BRING: 5+ years of experience in Product Management, ideally in high-velocity tech organizations. Strategic curiosity and proven ability to identify hidden opportunities and craft solutions that challenge industry norms. Fast execution mindset, with the ability to go from ideation to launch quickly, without sacrificing quality. Exceptional ability to create, manage, and iterate PRDs that drive high-impact product decisions. Strong cross-functional leadership skills, able to drive alignment and inspire teams with bold, clever approaches. Deep market and competitive awareness, with an ability to anticipate shifts and counteract competitor strategies. Experience in fraud prevention, risk management, cybersecurity, or fintech is a plus.
Negotiable Salary
Cookie
Cookie Settings
© 2025 Servanan International Pte. Ltd.