Browse
···
Log in / Register

Environmental Data Engineer / Machine Learning Engineer

Negotiable Salary

Grassroots Carbon

San Antonio, TX, USA

Favourites
Share

Description

Position: Environmental Data Engineer / Machine Learning Engineer Position Type: Full-Time Reports to: Jay Weeks, Director of Data & Soil Science  Role Overview As a Data Engineer / Machine Learning Engineer, you will play a pivotal role in bridging the gap between experimental prototypes and scalable, production-ready systems. You'll spend approximately 50% of your time optimizing and deploying data pipelines to support long-term business needs, and the other 50% developing advanced machine learning models for environmental mapping and predicting changes in environmental metrics (e.g., soil organic carbon stocks) using time series data. This is a hands-on position in a fast-paced startup environment, where you'll collaborate with cross-functional teams to deliver impactful, reliable solutions. Key Responsibilities Production Pipeline Development (50% of time): Evaluate and refactor prototype code from R&D phases into efficient, maintainable production pipelines. Design, implement, and maintain scalable data ingestion, processing, and ETL (Extract, Transform, Load) workflows using cloud-based infrastructure (e.g., AWS, GCP, or Azure). Ensure pipelines are robust, fault-tolerant, and optimized for performance, security, and cost-efficiency. Integrate monitoring, logging, and alerting systems to support ongoing operations and quick issue resolution. Collaborate with software engineers, scientists, data scientists, and other stakeholders to align pipelines with business objectives, enabling long-term scalability and reliability. Model Development (50% of time): Build, train, and deploy machine learning models for environmental quantification (e.g.,  digital soil mapping, predicting soil organic carbon stock changes, etc.). Work with time series data from various sources (e.g., satellite imagery, sensor data, historical records) to develop predictive models using techniques like time-series forecasting, geospatial analysis, and deep learning. Perform feature engineering, model evaluation, hyperparameter tuning, and validation to ensure accuracy and generalizability. Integrate ML models into production environments, including API development for real-time predictions and batch processing. Stay abreast of advancements in ML for geospatial and environmental applications, experimenting with new algorithms and tools to improve model performance. General Duties: Conduct code reviews, write documentation, and mentor junior team members on best practices in data engineering and ML. Troubleshoot and debug issues in both data pipelines and ML systems. Required Qualifications Bachelor's or Master's degree in Computer Science, Data Science, Machine Learning, Environmental Science, or a related field. 3+ years of experience in data engineering, with a proven track record of productionizing prototype code in a startup or fast-paced environment. Strong proficiency in programming languages such as Python (with libraries like Pandas, NumPy, Scikit-learn) and SQL. Experience with ML frameworks (e.g., TensorFlow, PyTorch, XGBoost) and timeseries analysis (e.g., Prophet, LSTM networks, PINNs). Hands-on experience with cloud platforms, containerization, and CI/CD pipelines. Familiarity with geospatial data processing and environmental modeling concepts, particularly in soil science or agriculture. Excellent problem-solving skills, with the ability to handle ambiguous requirements and deliver under tight deadlines. Preferred Skills Experience in digital soil mapping, carbon stock prediction models, advanced statistics, and/or Bayesian model calibration / inference Knowledge of big data technologies (e.g., Spark, Kafka) for handling large-scale timeseries datasets. Background in DevOps practices and infrastructure as code (e.g., Terraform). Passion for sustainability and environmental impact. Benefits:   Health Insurance plan with $0 deductible and $0 co-pay Dental and vision insurance plans Flexible spending account option.  Open Paid Time Off Policy plus 9 paid holidays per year as listed in our Company Handbook Participation in our 401(k) savings plan Company-paid Life and AD&D coverage Educational materials and expenses to support continuing education opportunities About Grassroots Carbon Grassroots Carbon is the leading grasslands restoration and soil carbon storage company that partners with landowners to implement and scale regenerative land management practices. In addition to enhancing soil health, promoting biodiversity, and improving water quality, these regenerative practices have tremendous potential to combat climate change by drawing down large quantities of atmospheric CO2 into the soil. Grassroots Carbon is proud to have partnered with ranchers across 1.6 million acres in 21 states to implement practices that restore grasslands, improve bird habitats, build soil health, and drive nature-based soil organic carbon drawdown through the healthy soils. Built on a foundation of scientific rigor, quality, and transparency, Grassroots Carbon has built strong partnerships with Audubon Conservation Ranching, Texas Agricultural Land Trust, Understand Ag, and Colorado State University’s Soil Carbon Solutions Center while generating high-quality soil carbon drawdown credits for leading corporations, including Nestle, Microsoft, Shopify, Marathon Oil, H-E-B, Olipop, and Urban Villages, to offset their carbon impact and reach their sustainability goals. *Grassroots Carbon is proud to be a portfolio company of Soilworks Natural Capital* About Soilworks Natural Capital: Grassroots Carbon is proud to be a portfolio company of Soilworks Natural Capital, which provides shared services to our fast-growing company. Soilworks is a private equity fund that invests in, incubates, and acquires companies to help accelerate the Regenerative Agriculture movement and is on a mission to prove Regenerative grazing is the most profitable way to ranch. Soilworks principles include better and healthier food, restoring plant and animal diversity, regenerating soil to store water and carbon, and creating more profitable family farms. Soilworks was launched by the co-founders of Scaleworks, a technology venture equity fund based in San Antonio, TX. We are proud to foster a workplace free from discrimination. We strongly believe that diversity of experience, perspectives, and background leads to a better environment for our employees and a better experience for our users and our customers. We are an equal-opportunity employer and do not discriminate against protected characteristics. All candidates will be given the same consideration. *No visa sponsorship is available for this position* 

Source:  workable View original post

Location
San Antonio, TX, USA
Show map

workable

You may also like

Workable
Site Reliability Engineer (req-174)
Team CATHEXIS elevates the government contracting experience through rapid response, deep skill, and thoughtful problem-solving and communication. Our core capabilities are our top-tier program and project management, data analytics, and audit services, the backbone of which is our integrated approach to operational excellence. You worked hard to get to where you are. You strive to make every day better than the day before. So do we. Team CATHEXIS operates with an all-in mindset. We are working together to create a company that supports our shared values and individual goals. Our values are centered around Respect, Engagement, Customer Service, Integrity, Teamwork, and Excellence in everything we do for our employees, clients, partners, and communities. We believe success is best when we listen and lead with empathy; model high standards of ethics to provide a rewarding candidate experience; work hard, have fun, and appreciate the strengths we all bring to the team; and empower our employees to create innovative and trusted results. We are looking for a dynamic Site Reliability Engineer (SRE) to join our team.  The Site Reliability Engineer (SRE) will manage, monitor, and optimize our clusters on Kubernetes. Together, we’re accelerating our clients’ digital transformation through the building and deployment of data-driven, scalable AI solutions.  The ideal candidate will have a deep understanding of Kubernetes, Cloud Infrastructure, and Infrastructure as Code (IaC) practices. You will be responsible for ensuring the reliability and scalability of our Kubernetes clusters and Cloud Infrastructure. Responsibilities: Monitor and Manage Kubernetes Clusters: Ensure the stability, health, and scalability of Kubernetes Clusters, deploying applications and services on Kubernetes Kubernetes Management: Deploy, monitor, and scale applications on Kubernetes clusters. Maintain Helm charts, manage services, and ensure resource allocation for optimal cluster performance Cloud Infrastructure Management: Work with leading Cloud Platforms (AWS, GCP, Azure) to set up, configure, and manage infrastructure resources using Infrastructure as Code (Terraform, CloudFormation, etc.) Monitoring & Incident Response: Set up monitoring solutions, define alerts, and manage the incident response process for any issues related to Jenkins, or Kubernetes clusters Automate Infrastructure Processes: Build automation tools for scaling, monitoring, and maintaining infrastructure using modern tools like Terraform, Ansible, or equivalent Collaborate Across Teams: Work closely with development, services, and operations teams to ensure a seamless integration between application development and infrastructure Security & Compliance: Ensure all systems follow best practices in terms of security and compliance with relevant regulations. This includes role-based access, encryption, and automated vulnerability scanning Requirements: Active Secret Clearance is required Bachelor’s degree (or equivalent) in computer science or related discipline A minimum of two(2) years of experience working with on-premise and off-premise cloud environments Experience with AWS, Azure and / or GCP Ability to program (structured and OOP) using one or more high-level languages, such as Python, Java, C/C++, Ruby, and JavaScript Experience with distributed storage technologies such as NFS, HDFS, Ceph, and Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn) Proactive approach to identifying problems, performance bottlenecks, and areas for improvement Agile/Scrum experience CATHEXIS offers competitive compensation packages to all eligible employees. Our goal is to provide a compensation package that reflects the value you bring to our team, is competitive with market rates, and promotes your financial security and personal well-being. The annual salary range for this role is $136,000 - $170,000. Please note that the salary information provided is a general guideline. CATHEXIS considers various factors in its final offer, including location, qualifications, experience, and skills.  CATHEXIS is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. If you are an individual with a disability and would like to request a reasonable accommodation as part of the employment selection process, please contact the Recruiting@cathexiscorp.com.
Tysons, VA, USA
$136,000-170,000/year
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.