A Look at Colly: The Golang Framework for Web Scraping. Web scraping is a handy tool to have in your arsenal. There are many web scraping frameworks on Go. I have chosen Colly, as it has many stars on GitHub.

Leverage the fast and elegant framework to extract web content.

Web scraping is a handy tool to have in your arsenal. It can be useful in a variety of situations, like when a website does not provide an API or you need to parse and extract web content programmatically. This tutorial walks through using the standard library to perform a variety of tasks like making requests, changing headers, setting cookies, using regular expressions, and parsing URLs. It also covers the basics of the package to scrape information from an HTML web page on the internet. TK

There are many web scraping frameworks on Go. I have chosen Colly,as it has many stars on GitHub and allows traversing to parent/child/sibling elements easily and effectively.

A Few Pointers Before We Get Started

  1. Developer tools should be totally used. Many times, elements won’t be present with IDs and class names (e.g. Facebook after their new UI update). All elements have randomized classes and IDs. Use XPath for these scenarios.
  2. For pages with infinite scrolling, use a scroller component to scroll the whole page before you start scraping, as the elements needed might not be present in the first view of the page and can be loaded after that. So one good way would be to scroll once to the end and then start scraping.

