Web scraping and API scraping are predominantly the most practical data harvesting methods. But what do these two terms mean? What is the difference and, what is their role in the data harvesting war? The following article defines each and discusses the advantages and disadvantages of each.
73 posts tagged with "data extraction"View All Tags
Web Scraping with Deno
Dynamic languages are helpful tools for web scraping. Scripting allows users to rapidly tie together complex systems or libraries and express ideas without dealing with memory management or build systems.
Scrape a Dynamic Website with Python
Turn Any Website Into An API with AutoScraper and FastAPI
In this article, we will learn how to create a simple e-commerce search API with multiple platform support: eBay and Amazon. AutoScraper and FastAPi provide the ability to create a powerful JSON API for the date. With Playwright's help, we'll extend our scraper and avoid blocking by using ScrapingAnt's web scraping API.
6 Puppeteer Tricks to Avoid Detection and Make Web Scraping Easier
As you know, Puppeteer is a high-level API to control headless Chrome, and it's probably one of the most popular web scraping tools on the Internet. The only problem is that an average web developer might be overloaded by tons of possible settings for a proper web scraping setup.
I want to share 6 handy and pretty obvious tricks that should help web developers to increase web scraper success rate, improve performance and avoid bans.
How to use a proxy in Playwright
How to use rotating proxies with Puppeteer
How to use Microsoft Edge with Playwright
Web scraping a website with the actually supported or other browsers has a real benefit in ensuring that the scraper will not be banned by the fingerprint or the behavioral pattern. Playwright already provides full support for Chromium, Firefox, and WebKit out of the box without installing the browsers manually, but since most of the users out there use Google Chrome or Microsoft Edge instead of the open-source Chromium variant, in some scenarios, it's safer to use them to emulate a more realistic browser environment.
GPT-2 answers what is Web Scraping
Please, don't consider this article too serious.
While playing around machine learning, we've found pretty interesting white paper about GPT-2. Let's find out what it can generate about web scraping!