Date: Nov 17, 2023

PATCH requests allow partial updates to resources via APIs. Python's requests module makes it easy to send PATCH requests and modify specific attributes using JSON patch docs.

Author: Mohan Ganesan

Date: Oct 31, 2023

The Ultimate HTML::TreeBuilder Cheatsheet in Perl

Author: Mohan Ganesan

Date: Oct 31, 2023

HTML::TreeBuilder is a Perl module for parsing and manipulating HTML and XML documents into a tree structure.

Date: Oct 15, 2023

Learn how to use PHP and the DOM extension to download images from a Wikipedia page and extract data from HTML tables. Use Proxies API for scraping at scale.

Author: Mohan Ganesan

Date: Oct 22, 2023

The Python requests library provides a powerful Session object for handling HTTP requests. Sessions allow you to persist settings, reuse connections, and handle cookies automatically.

Demystifying Authentication with Python Requests

Author: Mohan Ganesan

Date: Oct 22, 2023

Authentication can be tricky when working with APIs and web scraping. Python Requests provides various authentication schemes like basic, token-based, and digest authentication to make it easier. Understand the available auth classes and implement them properly to seamlessly integrate authentication into your Python scripts and apps.

Date: Feb 6, 2024

JavaScript uses urllib library to fetch data from URLs, including JSON APIs, in web browsers and Node.js environments.

Author: Mohan Ganesan

Date: Jan 21, 2024

Making Asynchronous HTTP Requests in Python with aiohttp Connectors

Author: Mohan Ganesan

Date: Feb 22, 2024

The aiohttp library provides a powerful tool for making asynchronous HTTP requests in Python. The aiohttp.TCPConnector manages connection pooling and reuse, allowing for improved performance and optimization of HTTP clients and services.

Automating Web Interactions in Python with Requests

Author: Mohan Ganesan

Date: Feb 3, 2024

Automate web interactions with Python Requests library. Easily submit forms, scrape data, and click buttons programmatically.

Author: Mohan Ganesan

Date: Oct 15, 2023

Making Async HTTP Requests in Python with requests and asyncio

Author: Mohan Ganesan

Date: Feb 3, 2024

Python requests library provides API for HTTP requests. asyncio and aiohttp enable non-blocking requests. grequests uses asyncio for concurrent requests. asyncio is efficient for I/O heavy work.

Author: Mohan Ganesan

Date: Jan 21, 2024

Downloading Images from a Website with VB and HtmlAgilityPack

Author: Mohan Ganesan

Date: Oct 15, 2023

Learn how to use Visual Basic and HtmlAgilityPack to download images from a Wikipedia page and extract data on dog breeds.

CSS Selectors vs XPath with BeautifulSoup: How to Choose the Right Selector

Author: Mohan Ganesan

Date: Oct 6, 2023

CSS selectors and XPath expressions are powerful techniques for parsing and extracting data from HTML and XML. CSS selectors offer simplicity and readability, while XPath provides unmatched query power and flexibility. Combining both can give you a robust toolkit for efficient data extraction.

Date: Feb 3, 2024

Common problems and solutions when sending requests through a proxy server in Python code.

Date: Feb 20, 2024

Understanding URLs is key for web development in Python. URLs have three main components: protocol, domain name, and path. Python provides modules for working with URLs.

Author: Mohan Ganesan

Date: Oct 15, 2023

Making HTTP Requests in Python: requests vs. pycurl

Author: Mohan Ganesan

Date: Feb 3, 2024

Python provides options for making HTTP requests. Use requests library for basic needs and pycurl for more control.

Testing Asynchronous Code with Aiohttp Test Utilities

Author: Mohan Ganesan

Date: Mar 3, 2024

The aiohttp library in Python provides utilities for testing asynchronous code. Use aiohttp.test_utils module to test web APIs and apps.

What are the fastest languages for web scraping?

Author: Mohan Ganesan

Date: Feb 5, 2024

Web scraping involves extracting data from websites. Choosing the right programming language is crucial for scraping large sites. C++ and Rust offer speed, while Go provides simplicity and speed.

Author: Mohan Ganesan

Date: Oct 1, 2023

Making the Most of asyncio: Adding Tasks to Event Loops

Author: Mohan Ganesan

Date: Mar 25, 2024

The asyncio module in Python provides infrastructure for writing asynchronous code using the async/await syntax. The event loop is at the heart of asyncio and manages task execution. Enqueue tasks with loop.create_task() or ensure_future().

Stripping HTML Tags from Text with BeautifulSoup

Author: Mohan Ganesan

Date: Oct 6, 2023

Extract text content from HTML using BeautifulSoup's get_text() method and extract attributes from tags.

Scraping Real Estate Listings from Realtor with CSharp

Author: Mohan Ganesan

Date: Jan 9, 2024

Scrape real estate listing data from Realtor.com using C# and HtmlAgilityPack library. Extract information like broker name, price, beds, baths, sqft, lot size, and address.

Scraping Craigslist Listings with Elixir

Author: Mohan Ganesan

Date: Oct 1, 2023

Author: Mohan Ganesan

Date: Oct 1, 2023

Handling HTTP Response Codes with Python's urllib

Author: Mohan Ganesan

Date: Feb 8, 2024

Check HTTP response codes in Python using urllib. Get the response code and reason phrase to understand the outcome of web requests.

Scraping Multiple Pages in Scala with HTTP Client and XML Libraries

Author: Mohan Ganesan

Date: Oct 15, 2023

Web scraping in Scala using HTTP client and XML libraries to extract data from multiple pages. Use XPath expressions and proxies for scalability.

Date: Oct 1, 2023

Learn how to scrape Craigslist apartment listings using Rust and the reqwest and selectors crates.

SERP APIs That Can Search Google At Scale

Author: Mohan Ganesan

Date: Jan 9, 2024

Author: Mohan Ganesan

Date: Jan 21, 2024

Downloading Images from a Website with Objective-C and Ono

Author: Mohan Ganesan

Date: Oct 15, 2023

Learn how to use Objective-C and AFNetworking and Ono libraries to download images from a Wikipedia page and scrape data.

Scraping Booking.com Property Listings in Ruby in 2023

Author: Mohan Ganesan

Date: Oct 15, 2023

Learn how to scrape property listings from Booking.com using Ruby, Nokogiri, and OpenURI libraries. Use proxies for scaling web scraping.

ProxyScrape Residential Proxies Alternative - Simplify Web Scraping with ProxiesAPI

Author: Mohan Ganesan

Date: Sep 30, 2023

ProxiesAPI simplifies web scraping with a single API call and unlimited bandwidth, beating ProxyScrape's manual proxy rotation and per GB usage fees.

BrightData Alternative - ProxiesAPI for Web Scraping

Author: Mohan Ganesan

Date: Sep 30, 2023

Web scraping made simple with ProxiesAPI, offering automatic proxy rotation, CAPTCHA solving, and javascript rendering. Affordable and easy to use compared to BrightData.

Scraping Hacker News with PHP

Author: Mohan Ganesan

Date: Jan 21, 2024

Troubleshooting 403 Errors: cURL Works but Python Requests Gets Forbidden

Author: Mohan Ganesan

Date: Apr 2, 2024

Requests handles sessions and state differently than cURL - make sure to use Session objects. Check for CSRF middleware that may require tokens. Verify Python code passes through expected authorization headers.

Author: Mohan Ganesan

Date: Oct 1, 2023

Scraping Craigslist Listings with Scala

Author: Mohan Ganesan

Date: Oct 1, 2023

Learn how to scrape Craigslist apartment listings using Scala and the play-ws library. Use XML parsing and a rotating proxy server to avoid IP blocking.

Date: Mar 17, 2024

Python's asyncio module enables asynchronous I/O for improved concurrency. Use asyncio for I/O-bound tasks and when concurrency is needed.

Web Scraping Google Scholar in Objective-C

Date: Jan 9, 2024

Code to extract real estate listing data from Realtor.com for properties in San Francisco using Axios and Cheerio.

Mastering XPath Locators for Reliable Selenium Tests

Author: Mohan Ganesan

Date: Jan 9, 2024

Locators in test automation allow for the identification of elements on a web page. XPath locators are robust and flexible, making them ideal for scalable test automation. By mastering XPath syntax and operators, test engineers can construct dynamic locators to handle complex scenarios. Integrating XPath locators into Selenium scripts requires understanding the difference between finding a single element and multiple elements. Best practices include reusing locators through the Page Object Model pattern and handling exceptions carefully. Troubleshooting XPath issues involves verifying locator accuracy, outputting attribute values, and using more resilient variations. Overall, mastering XPath locators is crucial for successful UI test automation using Selenium.

Scraping Hacker News in Node.js

Author: Mohan Ganesan

Date: Jan 21, 2024

Date: Oct 5, 2023

Encoding URLs in Python with urllib

Author: Mohan Ganesan

Date: Feb 8, 2024

When building web applications in Python, you'll often need to encode URLs and their components to ensure they are valid and can be transmitted properly between the client and server.

Scraping Hacker News with Ruby

Author: Mohan Ganesan

Date: Jan 21, 2024

Author: Mohan Ganesan

Date: Jan 21, 2024

Author: Mohan Ganesan

Date: Jan 21, 2024

Scraping Hacker News Articles with Perl

Author: Mohan Ganesan

Date: Jan 21, 2024

Author: Mohan Ganesan

Web Scraping in Python - The Complete Guide

Web Scraping using ChatGPT - Complete Guide with Examples

The Complete Playwright Cheatsheet

Building a Simple Proxy Rotator with Kotlin and Jsoup

Working with Query Parameters in Python Requests

The Complete BeautifulSoup Cheatsheet with Examples

The Complete Puppeteer Cheatsheet

How to Handle Timeout error in Python requests

Python Requests Cheatsheet

How to fix SSLError in Python requests

Downloading Files with Python Requests - Tips, Tricks and Code Example

Persisting Cookies with Python Requests for Effective Web Scraping

Scrape Any Website with OpenAI Function Calling in Python

The Ultimate Loofah Cheatsheet for Ruby

How to Authenticate with Bearer Tokens in Python Requests

The Ultimate Nokogiri Cheat Sheet for Ruby

Using Proxies with Python Requests

Accessing HTTPS Sites with Self-Signed Certs in Python Requests

Scrape Any Website with OpenAI Function Calling in PHP

The Complete Libxml2 C++ Cheatsheet

How to Build a Simple HTTP Proxy in Rust in just 40 lines

The Complete HTTPBin CheatSheet in Python

Sending Multipart Form Data with Python's urllib

How to Build a Simple HTTP Proxy in CSharp in just 25 lines of code

Uploading Images with Python Requests

The Complete Guide to Retrying Failed Requests with Axios

Accessing Your Local Web Server from Python Requests

Authenticating Python Requests: A Practical Guide to Using Tokens for API Access

How to Build a Super Simple HTTP Proxy in C++ in just 30 lines of code

The Ultimate Select.rs Cheat Sheet for Rust

The Ultimate Cheat Sheet for HtmlAgilityPack in CSharp

Retrying Failed Requests in Python Requests (with Code Examples!)

Expert Techniques for Disabling SSL Certificate Verification in Python Requests

The Ultimate HTML::Parser Perl Cheat Sheet

The Ultimate Goquery Cheatsheet

Caching in Python

Fixing “ModuleNotFoundError: No module named ‘requests’” Error in Python

Fixing the "bytes-like object is required, not 'dict'" Error in Python Requests

Troubleshooting the WinError 10061 with Python Requests

Bypassing Captcha with Selenium and Anti-Captcha Services

Troubleshooting 403 Errors when Web Scraping in Python Requests

The Ultimate Floki Cheatsheet for Elixir

Using Python Requests to Ping an IP Address

How to Build a Super Simple HTTP Proxy in Kotlin in just 20 lines of code

Easy Guide: Installing the Requests Module for Python in VS Code

Web Scraping Websites with Login Example Using Python

Introduction to Web Scraping with BeautifulSoup

A Guide to Using XPath with BeautifulSoup for Powerful Web Scraping

Making Partial Updates with PATCH Requests in Python

Mastering User Agents with Python Requests

The Ultimate Jsoup Cheatsheet in Java

Downloading Files in Python with aiohttp

Accessing URLs Requiring Authentication with Python's urllib

Web Scraping into Excel using ChatGPT

How to fix MissingSchema error in Python requests

Setting the Content-Type Header for Python Requests

Speed Up Slow requests.get() Calls in Python

Making Asynchronous HTTP Requests in Python without Waiting for a Response

How to fix ReadTimeout error in Python requests

The Ultimate Rvest Cheatsheet in R

The Ultimate Goutte Cheat Sheet for PHP

The Ultimate HTML::TreeBuilder Cheatsheet in Perl

Web Scraping in PHP - The Complete Guide

Fetching the Server IP Address with Python Requests

Node Unblocker: The Ultimate Tool for Web Scraping

Web Scraping with Scala & ChatGPT

How to Setup Proxy in Selenium in 2024

Scraping Leads using ChatGPT: A How-To Guide

How to install urllib in Python?

The Ultimate KSoup Cheatsheet for Kotlin

Sending Text Data in a POST Request with Python Requests

How to fix TooManyRedirects error in Python requests

Handling 404 Errors when Making HTTP Requests in Python

Python's URL Handling Libraries compared - urllib vs requests

Web Scraping with PHP & ChatGPT

Understanding HTTP Status Codes with Python Requests

Speeding up Python Requests using gzip and other techniques

A Beginner's Guide to Uploading Files with Python Requests

Web Scraping with Perl & ChatGPT