Create new Content Grabber agents
Edit existing Content Grabber agents
(Limited functionality in Server edition)
Export Content Grabber agents
Import Content Grabber agents
(Server edition cannot upgrade Trial agents to full agents)
Easy point and click configuration
ExpandSimply click on web elements to configure website navigation and content capture.
Extract complete content structures
ExpandExtract complete online product catalogues, search results, job boards, etc.
Create lists of web elements
ExpandFollow lists of links or extract lists of content elements.
ExpandFollow any type of pagination to extract all pages from search results.
Handle website logins
ExpandLogin to access and scrape secure web sites (e.g. social media, web email, intranet sites).
Submit all types of web forms
ExpandAutomatically submit web forms repeatedly for all possible form field input values.
Control Processing Flow
ExpandControl processing flow with Branch commands (if..then..else), and conditional Exit and Retry commands.
Capture any content from a website
ExpandCapture text, links, images, files, meta tags, videos and much more.
ExpandCapture screenshots of web elements or entire web pages.
Capture web pages as PDF or HTML
ExpandCapture entire web pages as PDF or HTML documents.
Capture hidden content
ExpandCapture text from HTML source code that is hidden from view.
Refine extracted content
ExpandRefine content with simple click and highlight operations, or use more complex Regular Expressions.
Track changes on a website
ExpandTrack changes on a website, including deleted content, added content and updated content.
Crawl entire websites
ExpandRepeatedly follow all links on a website to look for specific content, such as emails or phone numbers.
Use input data from almost any data source
ExpandInput data can provide lists of start URLs, form field values, etc.
Scroll to load dynamic data
ExpandAutomatically scroll down a page when required to load and extract dynamically loaded content.
ExpandExtract data from the most complex dynamic websites.
Capture files generated on-the-fly
ExpandCapture files, such as reports, that are generated on-the-fly as a result of an action.
Intelligent selection engine
ExpandConstruct robust selection XPATHs that work even when a website changes slightly.
Full XPATH v1.0 support
ExpandSelection engine with XPATH 1.0 support and some extended functionality.
Convert images to text
ExpandResolve CAPTCHAs and other images using integrated 3rd party OCR/CAPTCHA services.
Extract data from non-HTML documents
ExpandExtract data from document types such as PDF or Excel by using 3rd party document converters. We provide converters for PDF, Excel, CSV and Docx.
Web browser activity monitor
ExpandMonitor web browser activity to help optimize page loads and other actions.
Web request editor
ExpandUse the Web Request Editor in combination with the Web Browser Activity Monitor to develop high performance agents that extracts data directly from API end points used by a website, which is much faster than downloading whole web pages.
Visual and detailed view of web pages
Customizable user interface
ExpandEasily explore agent commands and web pages loaded by an agent.
ExpandCopy and paste commands, or drag and drop commands within an agent.
Create images from website
ExpandSave screenshot images, PDFs or HTML for all types of web pages an agent has been configured to visit. This feature can be used to check if a target website has changed substantially since an agent was developed.
Command template library
ExpandAccess commonly used sets of commands.
Agent template library
ExpandPre-configured agents for commonly scraped websites.
Custom template libraries
ExpandAdd your own commands and agents to template libraries for later reuse.