Skip to main content

9 posts tagged with "python"

View All Tags

· 16 min read
Oleg Kulyk

Python Requests Proxy | How to Use Proxy Types in Python Requests

Python requests are a helpful tool that makes sending HTTP requests easier for Python programs. It simplifies online API connections, website data retrieval, and other web tasks.

Proxy servers are a key part of web scraping, which enables mass data extraction from websites. By utilizing proxies in web scraping with Python requests, you can overcome restrictions, enhance privacy, mitigate IP blocking risks, and effectively gather the data you need for your projects or analysis.

· 13 min read
Oleg Kulyk

Breaking Down IP Restrictions: How to Overcome Website Limits and Gather Data Safely

As the internet grows, I'm finding that many website owners are using IP restrictions to protect their content from unauthorized access. Essentially, IP restrictions limit the requests a user can make to a website within a specific period. Still, they can also pose a challenge for web scrapers like me trying to gather data from the site. In this blog post, I'll explain how IP restrictions work, why they're used, and explore different ways that I can overcome these limitations as a web scraper.

· 8 min read
Oleg Kulyk

A Quick Guide to Parsing HTML with RegEx

Parsing HTML documents can be complex and tedious, but it is an integral part of web development. It is common to parse HTML pages to extract the required information when working with web scraping or website building. One of the methods applied to parse HTML pages is through the use of regular expressions (RegEx).

This guide will walk you through how to parse HTML with RegEx using Python, along with best practices and tips.

· 6 min read
Oleg Kulyk

Jupyter vs Spyder. Which One Is  Better for Python Programming?

Programming has been at the heart of the massive technological developments that we have experienced over the past few decades or so. From data analysis to web development, programming languages such as Python have significantly aided technological innovation. In fact, the position of Python among the pantheon of programming languages is unique.

Most web developers and data scientists prefer to use Python as their primary programming language over other options. And if you’re one such person working with Python or considering doing so, it’s essential to use IDE (Integrated Development Environment) to make programming more efficient and convenient.

· 10 min read
Oleg Kulyk

Scrape a Dynamic Website with Python

Internet extends fast and modern websites pretty often use dynamic content load mechanisms to provide the best user experience. Still, on the other hand, it becomes harder to extract data from such web pages, as it requires the execution of internal Javascript in the page context while scraping. Let's review several conventional techniques that allow data extraction from dynamic websites using Python.

· 4 min read
Oleg Kulyk

Top 5 Popular Python Libraries for Web Scraping in 2020

It is a well-known fact that Python is one of the most popular programming languages for data mining and Web Scraping. There are tons of libraries and niche scrapers around the community, but we’d like to share the 5 most popular of them.

Most of these libraries' advantages can be received by web scraping API and some of these libraries can be used in stack with it.