Browse
···
Log in / Register

Java Developer with Web Crawler Experience

Negotiable Salary

Axiom Software Solutions Limited

Austin, TX, USA

Favourites
Share

Description

Role: Java Developer with Web Crawler Experience Location: Austin TX(Hybrid) Responsibilities: 1. Web Crawler Development: Design and implement efficient and scalable web crawlers in Java to collect data from various online sources. 2. Data Extraction: Develop and maintain systems for structured data extraction, handling various data formats (HTML, JSON, XML, etc.). 3. Data Storage and Processing: Design data storage and processing pipelines, ensuring extracted data is clean, structured, and easily accessible. 4. Performance Optimization: Optimize web crawling processes for speed, efficiency, and accuracy, while ensuring minimal impact on source websites. 5. Error Handling and Logging: Implement error-handling mechanisms and logging systems to detect and resolve issues during crawling operations. 6. Data Integrity and Compliance: Ensure data collection practices are ethical, legal, and compliant with relevant regulations (e.g., robots.txt, copyright laws). Requirements: Proficiency in Java and experience with Java-based web scraping libraries (e.g., Jsoup, Apache HttpClient). Knowledge of web crawling frameworks and tools, such as Scrapy, Selenium, or Puppeteer. Strong understanding of HTML, CSS, JavaScript, and web data structures. Familiarity with data parsing and handling techniques for JSON, XML, and other common formats. Experience with database technologies (SQL, NoSQL) to store and manage scraped data. Knowledge of HTTP protocols, headers, proxies, and load handling.

Source:  workable View original post

Location
Austin, TX, USA
Show map

workable

You may also like

Workable
Machine Learning Engineer, ML Runtime & Optimization
Founded in 2016 in Silicon Valley, Pony.ai has quickly become a global leader in autonomous mobility and is a pioneer in extending autonomous mobility technologies and services at a rapidly expanding footprint of sites around the world. Operating Robotaxi, Robotruck and Personally Owned Vehicles (POV) business units, Pony.ai is an industry leader in the commercialization of autonomous driving and is committed to developing the safest autonomous driving capabilities on a global scale. Pony.ai’s leading position has been recognized, with CNBC ranking Pony.ai #10 on its CNBC Disruptor list of the 50 most innovative and disruptive tech companies of 2022. In June 2023, Pony.ai was recognized on the XPRIZE and Bessemer Venture Partners inaugural “XB100” 2023 list of the world’s top 100 private deep tech companies, ranking #12 globally. As of August 2023, Pony.ai has accumulated nearly 21 million miles of autonomous driving globally. Pony.ai went public at NASDAQ in Nov. 2024. Responsibility The ML Infrastructure team at Pony.ai provides a set of tools to support and automate the lifecycle of the AI workflow, including model development, evaluation, optimization, deployment, and monitoring. As a Machine Learning Engineer in ML Runtime & Optimization, you will be developing technologies to accelerate the training and inferences of the AI models in autonomous driving systems. This includes: Identifying key applications for current and future autonomous driving problems and performing in-depth analysis and optimization to ensure the best possible performance on current and next-generation compute architectures. Collaborating closely with diverse groups in Pony.ai including both hardware and software to optimize and craft core parallel algorithms as well as to influence the next-generation compute platform architecture design and software infrastructure. Apply model optimization and efficient deep learning techniques to models and optimized ML operator libraries. Work across the entire ML framework/compiler stack (e.g.Torch, CUDA and TensorRT), and system-efficient deep learning models. Requirements BS/MS or Ph.D in computer science, electrical engineering or a related discipline. Strong programming skills in C/C++ or Python. Experience on model optimization, quantization or other efficient deep learning techniques Good understanding of hardware performance, regarding CPU or GPU execution model, threads, registers, cache, cost/performance trade-off, etc. Experience with profiling, benchmarking and validating performance for complex computing architectures. Experience in optimizing the utilization of compute resources, identifying and resolving compute and data flow bottlenecks. Strong communication skills and ability to work cross-functionally between software and hardware teams Preferred Qualifications: One or more of the following fields are preferred Experience with parallel programming, ideally CUDA, OpenCL or OpenACC. Experience in computer vision, machine learning and deep learning. Strong knowledge of software design, programming techniques and algorithms. Good knowledge of common deep learning frameworks and libraries. Deep knowledge on system performance, GPU optimization or ML compiler. Compensation and Benefits Base Salary Range: $140,000 - $250,000 Annually Compensation may vary outside of this range depending on many factors, including the candidate’s qualifications, skills, competencies, experience, and location. Base pay is one part of the Total Compensation and this role may be eligible for bonuses/incentives and restricted stock units. Also, we provide the following benefits to the eligible employees: Health Care Plan (Medical, Dental & Vision) Retirement Plan (Traditional and Roth 401k) Life Insurance (Basic, Voluntary & AD&D) Paid Time Off (Vacation & Public Holidays) Family Leave (Maternity, Paternity) Short Term & Long Term Disability Free Food & Snacks
Fremont, CA, USA
$140,000-250,000/year
Workable
Linux Engineer
Resource Management Concepts, Inc. (RMC) provides high-quality, professional services to government and commercial sectors. Our mission is to deliver exceptional management and technology solutions supporting the protection and preservation of the people and environment of the United States of America. RMC is hiring a Linux Engineer in support of our Navy customer in Bethesda, MD.  The selected applicant will:  Patch and STIG Linux Operating to ensure compliance with DoD Information Assurance standards. Provide troubleshooting support for Linux/Windows Operating Systems Perform system updates and server configurations, including upgrades of the Operating System Implement changes to locally hosted workstations/servers Support virtual and physical networking configurations Provide hardware, software, and network troubleshooting Provide RedHat 8, or higher Enterprise administration, including workstations and servers Provide ACAS/Nessus vulnerability and scanning support Support distributed file systems Support Information Security Analyst in implementing and supporting cyber security standards to include NIST and Risk Management Framework (RMF) C&A Standards Document maintenance, repair, and test activities Create and maintain user accounts and install hardware/software Monitor status of LAN/WAN and circuit switching systems Write and maintain automation scripts for RHEL and other operating systems Qualifications: Demonstrated experience configuring and maintaining Linux servers and workstations Demonstrated knowledge and experience supporting Active Directory, Group Policy, and DNS Demonstrated Skills in three or more of the following: Red Hat Linux (RHEL), driver, applications, vulnerabilities, security requirements and postures, quarterly STIG updates, interact with corporate and vendor SMEs to solve complex problems, RMF experience, ACAS scanning, build and maintain Linux Systems Experience documenting trouble reports from STIGs to support computer equipment modifications Requirements Minimum of four (4) years of demonstrated experience administering Linux Systems Administrator. Must possess an IAT II 8140.03 baseline certification (Security+ CE, CCNA Security, CySA+, GICSP, GSEC, CND SSCP) or higher. Must possess Operating System (Linux) training and thereafter maintain the most current training. An active DoD Top Secret clearance is required. Applicant selected may be subject to a security investigation and must meet eligibility requirements for access to classified information. Experience in writing and managing Ansible playbooks, creating automation tasks via Ansible Automation Platform. Experience managing RedHat Satellite Server, including provisioning, package synchronization, and patch management lifecycle. Familiarity with centralized Identity Management solutions. Benefits At RMC, we're committed to your career growth! RMC differentiates itself from other firms through its investment in our employees. We invest our resources to train, certify, educate, and build our employees. RMC can offer you a great place to work with a small company feel and give you the experience, tuition assistance, and certifications that will take your career to the next level. This includes a competitive paid vacation package with 11 paid federal holidays. We also offer high-quality, low-deductible healthcare plans, pet insurance, and a competitive 401K package. Salary at RMC is determined by various factors, including but not limited to location, a candidate's specific combination of education, knowledge, skills, competencies, and experience, as well as contract-specific requirements. The current salary range for this position will be $110,000 to $130,000 (annually). #IND123 #LL-MP1
Bethesda, MD, USA
$110,000-130,000/year
Craigslist
Desktop Support Tech I/II --- Temporary for 4 - 8 weeks (Sacramento)
Desktop Support Technician level I/II (temporary) needed for approximately 4 – 8 weeks. The work is to begin on or around September 29, 2025, and would include visiting various sites for our client, in and around the greater Sacramento, CA area. Mileage will be reimbursed at the federal rate. We are seeking a Desktop Support Technician on a temporary basis (for approximately 4 – 8 weeks) to complete Windows 11 upgrades and perform reimaging on laptops and desktops. This would include setting up users and some end user training as needed as well as possibly replacing hard drives. The hours would primarily be Monday – Friday 8:00 to 5:00 but might require afterhours work but would be rare if any. The candidates selected must have pervious Desktop Support experience in a business environment including: upgrading desktops and laptops to Windows 11, reimaging and end user training. This is an onsite position, and the person selected MUST have reliable personal transportation, be punctual and dependable with excellent professional customer service skills. They must also have a clean state and federal background check. This temporary position (for approximately 4 - 8 weeks) would begin on or about September 29, 2025. The hourly pay rate is $25 – $27 per hour. Candidates who meet the above criteria should attach a resume along with their current availability and desired hourly pay rate.
1029 J St, Sacramento, CA 95814, USA
$25-27/hour
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.