Browse
···
Log in / Register

Java Developer with Web Crawler Experience

Negotiable Salary

Axiom Software Solutions Limited

Austin, TX, USA

Favourites
Share

Description

Role: Java Developer with Web Crawler Experience Location: Austin TX(Hybrid) Responsibilities: 1. Web Crawler Development: Design and implement efficient and scalable web crawlers in Java to collect data from various online sources. 2. Data Extraction: Develop and maintain systems for structured data extraction, handling various data formats (HTML, JSON, XML, etc.). 3. Data Storage and Processing: Design data storage and processing pipelines, ensuring extracted data is clean, structured, and easily accessible. 4. Performance Optimization: Optimize web crawling processes for speed, efficiency, and accuracy, while ensuring minimal impact on source websites. 5. Error Handling and Logging: Implement error-handling mechanisms and logging systems to detect and resolve issues during crawling operations. 6. Data Integrity and Compliance: Ensure data collection practices are ethical, legal, and compliant with relevant regulations (e.g., robots.txt, copyright laws). Requirements: Proficiency in Java and experience with Java-based web scraping libraries (e.g., Jsoup, Apache HttpClient). Knowledge of web crawling frameworks and tools, such as Scrapy, Selenium, or Puppeteer. Strong understanding of HTML, CSS, JavaScript, and web data structures. Familiarity with data parsing and handling techniques for JSON, XML, and other common formats. Experience with database technologies (SQL, NoSQL) to store and manage scraped data. Knowledge of HTTP protocols, headers, proxies, and load handling.

Source:  workable View original post

Location
Austin, TX, USA
Show map

workable

You may also like

Workable
Numerical Algorithm Software Engineer
SciTec is a dynamic small business, with the mission to deliver advanced sensor data processing technologies and scientific instrumentation capabilities in support of National Security and Defense, and we are growing our creative team! We support customers throughout the Department of Defense and U.S. Government in building innovative new tools to deliver unique world-class data exploitation capabilities. Important Notice: SciTec exclusively works on U.S. government contracts that require U.S. citizenship for all employees. SciTec cannot sponsor or assume sponsorship of employee work visas of any type. Further, U.S. citizenship is a requirement to obtain and keep a security clearance. Applicants that do not meet these requirements will not be considered. SciTec has immediate opportunities for talented software & algorithm developers and engineers to support programs focusing on low-latency data processing, fusion, and tracking algorithms for exploitation of remote sensing systems. Our ideal candidate will work well in multiple software languages as part of a rapid pace, collaborative, small-team environment consisting of Scientists, Engineers, and Developers and be able to prototype and develop advanced algorithms leading to eventual integration in C++ on Linux operating systems as part of government frameworks. Responsibilities Research new algorithms and analysis techniques for remote sensor data exploitation Demonstrate fluent, idiomatic mastery of Python and/or C++; comfort with software design and architecture Develop proof-of-concept signal processing, image processing, and data exploitation tools in Python Characterize quality/performance of algorithms and sensor systems Work as part of an Agile team and contribute to shared tools Other duties as assigned Requirements Colorado Residents: In any materials you submit, you may redact or remove age-identifying information such as age, date of birth, or dates of school attendance or graduation. You will not be penalized for redacting or removing this information. A bachelor’s degree in the physical sciences, mathematics, engineering, or computer science At least five years of ongoing professional experience in defense and/or defense-related technological fields (additional years of education may be substituted for years of experience) Professional experience with state estimation, tracking, or Guidance, Navigation, and Control (GNC) Professional experience and fluency in the following languages: C++, Python Fluency with Linux operating systems Ability to work full-time in-person in Boulder, CO office location Detail oriented with good verbal and written communication skills Candidates who have any of the following skills will be preferred A current active DoD SECRET security clearance or higher An advanced degree in the physical sciences, mathematics, engineering, or computer science Professional experience with application orchestration and/or deployment to the cloud Professional experience with Agile software development Professional experience with the exploitation and analysis of OPIR, E/O, SAR, Spectral, RF, or other remotely sensed data Benefits SciTec offers a highly competitive salary and benefits package, including: Employee Stock Ownership Plan (ESOP) 3% Fully Vested Company 401K Contribution (no employee contribution required) 100% company paid HSA Medical insurance, with a choice of 2 buy-up options 80% company paid Dental insurance 100% company paid Vision insurance 100% company paid Life insurance 100% company paid Long-term Disability insurance 100% company paid Hospital Indemnity insurance Voluntary Accident and Critical Illness insurance Short-term Disability insurance Annual Profit-Sharing Plan Discretionary Performance Bonus Paid Parental Leave Generous Paid Time Off, including Holiday, Vacation, and Sick Pay Flexible Work Hours The pay range for this position is $117,000 - $168,000 / year. SciTec considers several factors when extending an offer of employment, including but not limited to the role and associated responsibilities, a candidate's work experience, education/training, and key skills. This is not a guarantee of compensation. SciTec is proud to be an Equal Opportunity employer. VETS/Disabled.
Boulder, CO, USA
$117,000-168,000/year
Workable
Salesforce Developer II ( remote )
We are seeking a candidate with strong development experience in AGILE projects using Apex, Visualforce and HTML5/JS tools, in an enterprise environment utilizing structured SDLC processes. Good candidates will have the ability to adjust rapidly to a dynamic setting and be able to adopt established development team standards and processes to deliver high quality customer facing web applications. Requirements Responsibilities: Design and develop dynamic, secure, high quality business solutions on the Force.com platform, for the healthcare industry. Create Data Dictionaries Generate application flow charts and technical documentation Defining technical specifications to meet business requirements for custom applications Function in an Agile, structured SDLC team environment Perform unit and integration testing Develop Distributed Integrations between Salesforce and proprietary Enterprise Applications Skills: Salesforce development (minimum 3 years of experience) SOQL (Salesforce Object Query Language) Visualforce/Lightning Components/Aura Components, Apex Proven experience in troubleshooting and solving complex logic problems Desired knowledge on Platform Events Desired knowledge on C#.NET (minimum 3 years of experience) Desired knowledge on .NET Standard/.NET Core Desired knowledge on Team Foundation Server/Azure DevOps Desired knowledge on SFDX Must demonstrate good communication skills Must be highly motivated, proactive, creative and thorough Must be able to thrive in a fast paced, Agile team environment Benefits Supportive, progressive, fast-paced environment Competitive pay structure Matching 401(k) with immediate vesting Medical, dental, vision, life, & short-term disability insurance AssistRx, Inc. is proud to be an Equal Opportunity Employer. All qualified applicants will receive consideration without regard to race, religion, color, sex (including pregnancy, gender identity, and sexual orientation), parental status, national origin, age, disability, family medical history or genetic information, political affiliation, military service, or other non-merit based factors, or any other protected categories protected by federal, state, or local laws. All offers of employment with AssistRx are conditional based on the successful completion of a pre-employment background check. In compliance with federal law, all persons hired will be required to verify identity and eligibility to work in the United States and to complete the required employment eligibility verification document form upon hire. Sponsorship and/or work authorization is not available for this position. AssistRx does not accept unsolicited resumes from search firms or any other vendor services. Any unsolicited resumes will be considered property of AssistRx and no fee will be paid in the event of a hire
Orlando, FL, USA
Negotiable Salary
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.