Browse
···
Log in / Register

Java Developer with Web Crawler Experience

Negotiable Salary

Axiom Software Solutions Limited

Austin, TX, USA

Favourites
Share

Description

Role: Java Developer with Web Crawler Experience Location: Austin TX(Hybrid) Responsibilities: 1. Web Crawler Development: Design and implement efficient and scalable web crawlers in Java to collect data from various online sources. 2. Data Extraction: Develop and maintain systems for structured data extraction, handling various data formats (HTML, JSON, XML, etc.). 3. Data Storage and Processing: Design data storage and processing pipelines, ensuring extracted data is clean, structured, and easily accessible. 4. Performance Optimization: Optimize web crawling processes for speed, efficiency, and accuracy, while ensuring minimal impact on source websites. 5. Error Handling and Logging: Implement error-handling mechanisms and logging systems to detect and resolve issues during crawling operations. 6. Data Integrity and Compliance: Ensure data collection practices are ethical, legal, and compliant with relevant regulations (e.g., robots.txt, copyright laws). Requirements: Proficiency in Java and experience with Java-based web scraping libraries (e.g., Jsoup, Apache HttpClient). Knowledge of web crawling frameworks and tools, such as Scrapy, Selenium, or Puppeteer. Strong understanding of HTML, CSS, JavaScript, and web data structures. Familiarity with data parsing and handling techniques for JSON, XML, and other common formats. Experience with database technologies (SQL, NoSQL) to store and manage scraped data. Knowledge of HTTP protocols, headers, proxies, and load handling.

Source:  workable View original post

Location
Austin, TX, USA
Show map

workable

You may also like

Workable
Senior Cloud Engineer - Java Backend Developer
Headquartered in the United States, TP-Link Systems Inc. is a global provider of reliable networking devices and smart home products, consistently ranked as the world’s top provider of Wi-Fi devices. The company is committed to delivering innovative products that enhance people’s lives through faster, more reliable connectivity. With a commitment to excellence, TP-Link serves customers in over 170 countries and continues to grow its global footprint. We believe technology changes the world for the better! At TP-Link Systems Inc, we are committed to crafting dependable, high-performance products to connect users worldwide with the wonders of technology.  Embracing professionalism, innovation, excellence, and simplicity, we aim to assist our clients in achieving remarkable global performance and enable consumers to enjoy a seamless, effortless lifestyle. Responsibilities: Analyze, design, and build the technical architecture for the unified management platform of telecom operator equipment. Develop frameworks, optimize technology, and enhance performance and cost efficiency for one or more of the following business scenarios: Multi-tenant management platforms: Abstract account management models, design network device behavior paradigms, and implement efficient device management and control. Large-scale device data ingestion: Handle and analyze massive volumes of telecom network device data. Optimize existing project architectures, improve performance, and refactor codebases. Monitor services effectively, troubleshoot high-traffic and complex production issues, and ensure system stability and availability. Contribute to team development by participating in discussions on workflow, coding/testing standards, and best practices. Mentor and guide engineers, helping to enhance the overall technical strength of the team. Requirements Educational Background Bachelor’s degree or higher in Computer Science, Software Engineering, or a related field; a Master’s degree is preferred. Work Experience 5+ years of experience in cloud computing, distributed systems, database systems, or related fields. Extensive experience in designing and implementing architectures for large-scale internet platforms or enterprise systems. Professional Skills Development Skills: Strong foundation in Java, with deep understanding of JVM internals. Hands-on experience with core middleware technologies such as Redis, Kafka, and gRPC. Proficiency in frameworks and technologies like SpringMVC, Netty, Spring Cloud, and Service Mesh. Expertise in design patterns, strong coding best practices, and excellent documentation skills. Database Knowledge Familiarity with the design and development of mainstream relational and NoSQL databases such as MySQL, Cassandra, and MongoDB. Distributed Architecture Expertise Proficient in common distributed architecture patterns and capable of proposing effective solutions for various business scenarios. Concurrency & Performance Optimization Expertise in transaction concurrency control and extensive experience in performance tuning (e.g., OS I/O optimization, network optimization). Ability to design locking mechanisms and address read/write amplification issues in high-concurrency environments. Additional Skills (Preferred, but not Required) Experience in designing and implementing architectures on public cloud platforms (e.g., AWS, Azure, or GCP). Experience in large-scale data processing. Benefits Salary range: $150,000-$180,000 Free snacks and drinks, and provided lunch on Fridays Fully paid medical, dental, and vision insurance (partial coverage for dependents) Contributions to 401k funds Bi-annual reviews, and annual pay increases Health and wellness benefits, including free gym membership Quarterly team-building events At TP-Link Systems Inc., we are continually searching for ambitious individuals who are passionate about their work. We believe that diversity fuels innovation, collaboration, and drives our entrepreneurial spirit. As a global company, we highly value diverse perspectives and are committed to cultivating an environment where all voices are heard, respected, and valued. We are dedicated to providing equal employment opportunities to all employees and applicants, and we prohibit discrimination and harassment of any kind based on race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. Beyond compliance, we strive to create a supportive and growth-oriented workplace for everyone. If you share our passion and connection to this mission, we welcome you to apply and join us in building a vibrant and inclusive team at TP-Link Systems Inc. Please, no third-party agency inquiries, and we are unable to offer visa sponsorships at this time.
Irvine, CA, USA
$150,000-180,000/year
Craigslist
Scheduler/Admin for a Small Dog Walking Business (Alexandria)
We are small dog walking and pet-sitting company dedicated to providing exceptional care for pets in the Alexandria, Arlington, and Falls Church area. Our team of experienced and passionate pet care professionals ensures that every pet receives the love, attention, and exercise they deserve. We pride ourselves on delivering reliable and top-notch service to our clients, allowing them peace of mind while they are away. Apply using this link: https://metropawlitanpetsitters.zohorecruit.com/jobs/MetropawlitanPetsittersjobs/754168000000650001/Scheduler?source=CareerSite Position Overview: We are seeking a dynamic and responsible Scheduler to join our growing team. The Scheduler will play a crucial role in overseeing and coordinating the daily operations of our dog walking and pet-sitting services. This individual will be responsible for communicating to our dedicated dog walkers and pet sitters and ensuring the highest level of service for our clients. Responsibilities: *Address client inquiries and concerns promptly and professionally. *Ensure client satisfaction through effective communication and problem resolution relating to their schedule. *Maintain strong relationships with clients and gather feedback to enhance service quality. *Maintain client schedule and profiles *Respond to the needs of clients - Scheduling and Coordination: Create and manage daily schedules for dog walkers and pet sitters. Assign clients to appropriate team members based on availability and expertise. Monitor and adjust schedules to accommodate client requests and changes. Address any performance issues or concerns with individual team members promptly. 🐾 What Makes You a Great Fit We’re looking for someone who’s not just organized — but thrives in fast-moving situations and understands how important pets are to their families. Detail-Oriented: You spot mistakes before they happen and love creating order from chaos. Calm Under Pressure: Last-minute changes don’t rattle you — you adapt quickly and keep things moving. Great Communicator: You write and speak with clarity, professionalism, and warmth. Problem-Solver: You enjoy finding solutions that work for both clients and sitters. Pet-Friendly: You believe pets are family and want to be part of a company that feels the same. Tech-Savvy: You’re comfortable with scheduling software, apps, and learning new tools. Previous experience in a admin role, preferably in the pet care industry. Ability to work independently and collaboratively with a diverse team. Knowledge and passion for animal care. Ability to step in and act as coverage for sitters/walkers Primarily remote position. However, you may need to step in to fill in for sitters.You should live in the Arlington, Alexandria or Fairfax County area only. Preference given to those who live within the beltway. If you are a dedicated and organized individual with a passion for pets, we would love to hear from you. Other qualifications for the position: * comfortable with all size dogs and most pets * ability to walk 3-5 miles daily * have a car in good condition and fully insured *have experience managing people Job Types: Part-time Pay: starting rate of $18/hour, pay will correlate with experience Expected hours: 20 – 30 per week Bonus opportunities Commission pay Are you comfortable walking, caring and managing large dogs? Are you comfortable caring for cats and small animals(i.e. hamster, rabbits) as well? Do you have experience working with automation, CRM and communication apps? Experience: Pet care: 2 years (Required) License/Certification: Driver's License (Required)
1213 King St, Alexandria, VA 22314, USA
$18/hour
Workable
Sr Informatica ETL Developer III
· Responsible for detailed design, development/unit testing and support for integration testing · Experienced in both working with QA teams for integration testing as well as being responsible for testing own work (i.e. QA is not always involved in every task - ability to thoroughly test their own code is mandatory) · Produce scalable and flexible, high-quality code that satisfies both the functional and non-functional requirements · Identify technical issues & coordinate the resolution of these issues with technical lead team members · Uses secure development best practices and design patterns · Create or update design and systems documentation for developed or modified services or programs · Create process and data flow diagrams for data movement capture · Cross train team members for full knowledge coverage on team · Analyze and translate business requirements to technical design · Analyze and resolve technical issues · Collaborate/communicate with project team and business users as required · Support functional testing and performance testing · Works with technical delivery lead on project activities · Ensure assigned work is implemented within project schedules Incudes all developer skills, plus the following; ·Minimum of 7+ years overall IT experience ·Strong Automotive OEM experience a plus ·Experienced in waterfall, iterative, and agile methodologies # Informatica ·DBMS: Oracle, MYSQL, Cloudera DataLake/SQL experience ·Strong experience with Informatica programming across three main products BDM, Power Center and IICS - IDQ and ICRT experience is a plus. ·Skilled in Microsoft Office applications (Visio, Word, Excel, Access) ·Experience in both Unix and Windows platforms ·Design and develop Informatica workflows to exchange data with the Oracle databases, Salesforce, Data Lake or other operational or warehouse data stores ·Be able to create or modify Perl scripts for job control and process flow ·Additonal experience a plus: Axon
Auburn Hills, MI, USA
Negotiable Salary
Cookie
Cookie Settings
Our Apps
Download
Download on the
APP Store
Download
Get it on
Google Play
© 2025 Servanan International Pte. Ltd.