← All projects
E-CommerceCanada·2024·5 months
Web Scraper on Google Cloud
Updated and cloud-hosted web scraper that pulls product data from multiple sites and feeds it into Google Sheets automatically.
Overview
Modified an existing scraping script for a Canadian client and deployed it on Google Cloud for scheduled, unattended operation.
Work Done
- Updated target site list (removed old sites, added new ones)
- Fixed broken selectors from site updates
- Added Google Sheets integration to push scraped data automatically
- Deployed on Google Cloud with scheduled execution
- Set up error alerting
Result
Fully automated pipeline running on schedule without manual intervention.
Tech stack
PythonSeleniumGoogle CloudGoogle Sheets API