Posts by: Web Scraper

Dec 092017 0 Responses

Puppeteer – Web Scraping using Headless Chrome Node API

Puppeteer is Headless Chrome browser developed by Google Team. A headless browser is a web browser without a graphical user interface(GUI) means that it has no visual components. Headless browsers enable you to control web page via programming without human intervention. In Programmer’s term, Puppeteer is a node library or API for Headless browsing as well as browser automation developed by Google Chrome team.Browser automation helps you to automate repetitive tasks and web application testing. For example, monitoring product pricing over period of time, form submission, automatically login to web app, perform some task and logout etc.There are many libraries for browser automation and web scraping like PhantomJS, Selenium IDE etc. However Puppeteer runs faster and uses less memory. Puppeteer only works with Google Chrome browser.Puppeteer can be used for:

Read More…

Nov 272017 0 Responses

Data Scraping Studio Review

scraperData Scraping Studio is an integrated and scalable platform which has been built to power your data scraping project. Aim behind this software is to make an automatic and most advanced data extraction engine so the user can enjoy fast data collection experience.Data Scraping Studio is used to extract the data from web pages, ajax sheets, xml, json and many more. It provides many services to the users which has been described in below section. Key services provided by this  are  Expert setup, maintenance, better performance, unlimited users, priority execution and expert support.  For the ease of understanding, Data Scraping Studio provides Help Center including Documentation, Forum, Video tutorials and API documentation. Read More…

Sep 212017 Tagged with , 0 Responses

WebHarvy Review

WebHarvy is a intuitive easy-to-use visual web scraper using which you can automatically scrape text, images, URL’s and emails from websites and you can save the scraped data into various formats. WebHarvy works with all the websites. User can also use WebHarvy for extracting data from product listings, eCommerce websites, yellow pages, real estate listing, social networks, forums etc.

WebHarvey Read More…

Feb 272017 Tagged with , , 0 Responses

Content Grabber V2 – With New Features

Content Grabber – a web scraping software developed by Sequentum is the most advanced web scraping software in the market and it just keeps getting better.

I am excited to highlight several enhancements to Content Grabber features in the Version 2: Chrome based Web browser, Edge Selections, Data Retention options, Change Tracking, Export to single database table, Export script templates, Export settings, File downloads, Screenshots, Retry errors, Simplified Action Configurations, Multiple command selections, Group commands, Improved XPath editor, New self-contained agents. These features better enable web scrapers to extract data from websites efficiently and effectively. Read More…

Jan 022017 Tagged with , , 0 Responses

HtmlAgilityPack to parse HTML in .NET

Introduction

web-scraping-using-htmlagilitypackScreen Scraping also known as Data Scraping or Data Extraction is a technique of collecting different kind of data from a web page like meta tag information, titles, images, links, contact information(phone & email) and other important data like weather forecasts.

To make Web Scraping into action using .NET, we have very useful .NET library known as HTMLAgilityPack. It provides essential methods navigating, modifying and searching DOM(Document Object Model) Tree. HTMLAgilityPack parses anything you give it even if it’s malformed HTML having missing closing tags, very tolerant! It supports XPath and XSLT for navigating the web page. Read More…

Dec 192016 Tagged with , , 0 Responses

Total.js Review

Total.js is a free and very powerful web application framework for building Web sites and Web applications using JavaScript, HTML and CSS. The framework is a server-side framework for Node.js. The framework is written in pure JavaScript.

node framework

Total.js framework has very simple logic, a short learning curve and many features for creating rich and scalable web applications. Read More…

Oct 172016 Tagged with , , , 0 Responses

Octoparse Review – An Automated Web Scraping Tool

octoparse-web-scraperOctoparse is a powerful automated web scraping software with an easy-to-use point-and-click user interface, which enables users to apply different patterns to extract data from different websites with ease.

It provides different advanced functions like Smart Mode, Cloud Extraction, API Access that helps users to capture data from any static or dynamic websites without any programming knowledge. Various export formats are available such as CSV, Excel, HTML, TXT. It also enables users to export extracted data into databases like MySQL, SQL Server, and Oracle. Read More…

Sep 192016 Tagged with , , , 1 Response

Custom Scripting in Content Grabber

Custom Scraping ScriptWhile Content Grabber is very easy to use web scraping software, you shouldn’t make the mistake to think it is not also very flexible and powerful. Part of this flexibility comes from providing developers with a sophisticated scripting capability for controlling a user’s web scraping agent and managing the data being extracted.

Content Grabber provides scripting in different ways to customize Content Grabber behavior based on your specific needs or to extend and enhance standard functionality. Content Grabber scripts are .NET functions written in C# or VB.NET, or regular expressions. Read More…

Jul 212016 Tagged with , , , , 0 Responses

How to find executive email addresses using Rapporative

rapporative chromeWhether you are a young entrepreneur looking to reach out to a CEO or an experienced marketer looking for leads, finding ceo and executive email addresses is difficult and time-consuming. But I have found a neat little trick that can help you get email address contact you need. It has worked for me 95% of the time, whether I am approaching to major press outlets or contacting prominent investors. Read More…

Jul 082016 Tagged with , , , 2 Responses

Xpath Generator – Free tool for making Xpath Expression

xpath generatorXPath is a query language for selecting nodes from HTML or XML document. XPath is used to navigate through elements and attributes in an HTML or XML document. Xpath is inevitable part of web scraping. To extract web element, one must know what is its XPath. Most of the web scrpaing software comes with inbuilt functionality to generate xpath expression easily and some browsers also support facility to inspect XPath but it lacks some advanced functionality. Keeping in mind these limitations, we have made a special tool for XPath Selection named “Xpath Generator”.

Read More…

1 2 3