Apply now for the Web Scraping Engineer position
Web Scraping Engineer
ROLE SUMMARY
Our client is looking for a Web Scraping Engineer who will be navigating websites to extract and save data using the necessary tools to accomplish extracting text data or saving pages to PDFs.
SCHEDULE: 8:00 AM – 5:00 PM Central Daylight Time (9:00 PM – 6:00 AM Philippine Standard Time), follows Philippine holidays
POSITION TYPE: Full Time
WORK ARRANGEMENT: Remote
ESSENTIAL FUNCTIONS
· Create processes to search sites given a set of data to retrieve information
· Save entire result pages to PDF using a standardized file naming convention and insert a link to the file into the database
· Extract certain data fields from the results page and then standardize them to be uploaded into a database for analytics
· Diagnose and resolve any issues that may occur with the automated process developed for scraping as the sites change on occasion
QUALIFICATIONS
· A Bachelor’s degree in the related field is preferred
· Familiarity with techniques used for crawling, extracting, and processing data from websites into PDF files and SQL databases (HTML, Java, and SQL)
· Experience with macro scripting (AutoHotkey and Microsoft Office VBA)
· Knowledgeable in bypassing bot detection techniques (VPN, proxy, behavioral criteria, and request counts)
· Knowledgeable in countering scraping countermeasures
· Strong troubleshooting skills to make needed adjustments
· Analytic skills to develop processes and compare results
· Experience in a multi-client environment
· Strong organization, oral and written communication skills
· Aptitude in data management, analytics, reporting preparation
· Ability to function in an autonomous environment—independent worker, self-directed