Introduction
Editions & Licensing
What is Web Scraping?
Web Scraping with Content Grabber
Web Scraping Limitations
Web Scraping Techniques
HTML Content
Dynamic Websites
XPath and Selection Techniques
Regular Expressions
Converting Visual Web Ripper Projects
Upgrading to Content Grabber 2
Quick Start with Content Grabber
Installing Content Grabber
Content Grabber Basics
Exploring the Main Window
Building Your First Agent
Choosing a Start URL
Select the Content to Capture
Refine Your Data (optional)
Output Data Format
Test Your Agent
Scheduling
Run Your Agent
Content Grabber Editor
Customizing the Editor Layout
Working With the Web Browser
Command Configuration
Copying or Moving Agent Commands
Selection Tools and Short-cut Keys
Navigating an Agent
Testing/Debugging an Agent
Using the Debugger
Using Logging
Scheduling
Cron Expressions
Windows Task Scheduler
Copying & Moving Agents
Exporting & Importing Agents
Automated Agent Backups
Template Libraries
Default Folders
Selection Techniques
Selection XPaths
Optimizing Selections
List Selections
Selection Anchors
Editing XPaths Manually
Agent Commands
Container Commands
Capture Commands
Web Element Content
Download Image
Download Video
Download Document
Download Screenshot
Download Page
Calculated Value
Page Attribute
Data Value
Action Commands
Agent
Navigate Link
Navigate URL
Navigate Pagination
Crawl Website
Set Form Field
Manual Navigation
Action Configuration
Action
Browser
Events
Wait
Wait for Content
List Commands
Data List
Web Element List
Group Commands
Group
Group in Page Area
Branch Commands
If..ElseIf..Else
Exit or Retry
Other Commands
Execute Script
Remove Duplicate
Transform Page
Refresh Document
Close Browser
Screenshot Log
Composite Commands
Resolve CAPTCHA
Web Forms
Command Library
Error Handling
Agent Error Handling
Error Logs and Notifications
Post Status
Extracting Data From Non-HTML Documents
Crawling a Website
Data
Database Connections
The Internal Database
Using Data Input
Data Providers
Input Parameters
Agent Data
Exporting Data
Selecting an Export Target
Exporting Downloaded Images and Files
Character Encoding
Changing the Default Data Structures
Exporting Data with Scripts
Export From Multiple Agents
Distributing Data
Email Distribution
FTP Distribution
Google Drive Distribution
Dropbox Distribution
Scripting Data Distribution
Removing Duplicate Data
Extracting New Data Only
Reusing Existing Data
Change Tracking
Data Counting
Anonymous Web Scraping
Obeying Robots Rules
CAPTCHA & IP Blocking
CAPTCHA Blocking
IP Blocking & Proxy Servers
Improving Agent Performance and Reliability
Using MS SQL as Internal Database
Using the HTML Parser
Optimizing Agent Commands
Performance Sessions
Multithreading
Building Self-Contained Agents
Customizing the User Interface
Using Input Data
Using a Self-Contained Agent
Running Agents from the Command-Line
Using the Content Grabber runtime
Running Multiple Instances of the Same Agent
Scripting
Script Languages
Script Library
Assembly References
Script Utilities
Script Template Library
Agent Initialization Scripts
Data Export Scripts
Data Distribution Scripts
Content Transformation Scripts
Data Input Scripts
Image OCR Scripts
Convert Document to HTML Scripts
Custom Scripts
Condition Scripts
Command Transformation
Programming Interface
Building a Desktop Application
Visual Studio Configuration
Distributing Your Application
Building a Web Application
Using the Content Grabber Agent Service
Using Simple Web Requests
Sessions
API Access Keys
Website Automation
Website Testing & Documentation