Skip to main content

82 posts tagged with "web scraping"

View All Tags

· 8 min read
Oleg Kulyk

A Quick Guide to Parsing HTML with RegEx

Parsing HTML documents can be complex and tedious, but it is an integral part of web development. It is common to parse HTML pages to extract the required information when working with web scraping or website building. One of the methods applied to parse HTML pages is through the use of regular expressions (RegEx).

This guide will walk you through how to parse HTML with RegEx using Python, along with best practices and tips.

· 9 min read
Oleg Kulyk

Puppeteer Debugging and Troubleshooting - Best Practices

Puppeteer is a powerful tool for automating web testing and scraping. However, it is still subject to problems and bugs like any other software.

It's crucial to have a well-thought-out plan for solving issues in place for times like these.

In this post, we'll explore some of the best practices for Puppeteer debugging and troubleshooting with Puppeteer.

· 6 min read
Oleg Kulyk

Becoming a Web Scraper - Scraping as Google Crawler for Maximum Results

Are you looking to become a web scraper? While web scraping can seem daunting, it doesn’t have to be. In this blog, we’ll discuss what web scraping is, how pretending to be a Google crawler can help you get the most out of web scraping, and how to use web scrapers for maximum results. So get ready, because you’re about to learn the ins and outs of web scraping and how to become a web scraper.

· 12 min read
Oleg Kulyk

Avoid Cloudflare with These 5 Proven Methods

Before we begin, it's essential to understand that data extraction and web scraping are legal gray areas. For some, this is highly immoral, if not outright illegal, so pay attention to what you're scraping and how you're using it. Scraping personal data, gathering information without permission, or copyrighted data (among other things) may be illegal. So make sure you're careful about what you get and what you do with it once you have it. This information can make a big difference for your business, but if you're not using it correctly, it could cause you problems.

· 9 min read
Oleg Kulyk

Web Scraping vs Web Crawling: Use Cases and Differences

Web scraping and web crawling are two different yet related approaches to gathering data from the internet.

Web scraping is a process of extracting specific pieces of information from a website, while web crawling is an automated bot system that regularly browses the World Wide Web to analyze and index large amounts of data on the resources.

· 9 min read
Oleg Kulyk

Puppeteer Vs. Selenium: Which Is Better?

With the increasing use of the internet worldwide, it is being implemented in all aspects of our daily lives. So using it efficiently and effectively becomes crucial and could be the difference between competitors and businesses. This is where the concept of Web Automation comes in. Today I shall teach you one of the most debated topics of web automation, Puppeteer vs. Selenium.

Let's begin!