Create new Content Grabber agents
Edit existing Content Grabber agents
(Limited functionality in Server edition)
Export Content Grabber agents
Import Content Grabber agents
Easy point and click configuration
ExpandSimply click on web elements to configure website navigation and content capture.
Extract complete content structures
ExpandExtract complete online product catalogues, search results, job boards, etc.
Create lists of web elements
ExpandFollow lists of links or extract lists of content elements.
ExpandFollow any type of pagination to extract all pages from search results.
Handle website logins
ExpandLogin to access and scrape secure web sites (e.g. social media, web email, intranet sites).
Submit all types of web forms
ExpandAutomatically submit web forms repeatedly for all possible form field input values.
Capture any content from a website
ExpandCapture text, links, images, files, meta tags, videos and much more.
ExpandCapture screenshots of web elements or entire web pages.
Capture hidden content
ExpandCapture text from HTML source code that is hidden from view.
Refine extracted content
ExpandRefine content with simple click and highlight operations, or use more complex Regular Expressions.
Crawl entire websites
ExpandRepeatedly follow all links on a website to look for specific content, such as emails or phone numbers.
Use input data from almost any data source
ExpandInput data can provide lists of start URLs, form field values, etc.
Scroll to load dynamic data
ExpandAutomatically scroll down a page when required to load and extract dynamically loaded content.
ExpandExtract data from the most complex dynamic websites.
Capture files generated on-the-fly
ExpandCapture files, such as reports, that are generated on-the-fly as a result of an action.
Intelligent selection engine
ExpandConstruct robust selection XPATHs that work even when a website changes slightly.
Full XPATH v1.0 support
ExpandSelection engine with XPATH 1.0 support and some extended functionality.
Convert images to text
ExpandResolve CAPTCHAs and other images using integrated 3rd party OCR/CAPTCHA services.
Extract data from non-HTML documents
ExpandExtract data from document types such as PDF or Docx by using 3rd party document converters.
Web browser activity monitor
ExpandMonitor web browser activity to help optimize page loads and other actions.
Visual and detailed view of web pages
Customizable user interface
ExpandEasily explore agent commands and web pages loaded by an agent.
ExpandCopy and paste commands, or drag and drop commands within an agent.
Command template library
ExpandAccess commonly used sets of commands.
Agent template library
ExpandPre-configured agents for commonly scraped websites.
Custom template libraries
ExpandAdd your own commands and agents to template libraries for later reuse.