All Articles

How to Parse Addresses using Python and Google GeoCoding API

How to Parse Addresses using Python and Google GeoCoding API

Web scraping can often lead to you having scraped address data which are unstructured. If you have come across a large number of freeform address as a single string, for example – “9 Downing St Westminster London SW1A, UK”,  you know how hard it would be to validate, compare and deduplicate these addresses. To start […]

How to Scrape Expedia using Python and LXML

How to Scrape Expedia using Python and LXML

Learn how to scrape flight details from Expedia.com, a leading travel and hotel site, using Python 3 and LXML in this web scraping tutorial. You’ll learn how to extract flight details such as flight timings, plane names, flight duration and more for a given source and destination.

Amazon vs Walmart- Products Sold in March 2017

Amazon vs Walmart- Products Sold in March 2017

As of March 2017, Walmart has a total of 22 million products on sale and Amazon has 375 million products. Walmart has only 5.8% of what Amazon has to offer. Competition with Amazon Walmart has seen a quantum leap with e-commerce sales picking up. With Walmart’s earnings in the fourth quarter seeming to be a definite blow to Amazon, […]

Number of Products sold on Amazon.com- March 2017

Number of Products sold on Amazon.com- March 2017

Amazon has a total of 337,173,768 products as on March 7th, 2017 That’s 7% less than February 2017. Amazon had 362 million products on February 4th, 2017. Top 10 Categories   Just like the preceding month, Digital Music category leads with 66.3 million products. The Home & Kitchen category has overtaken Electronics by 2.1 million products […]

Amazon vs Walmart – Products sold in February 2017

Amazon vs Walmart – Products sold in February 2017

As of February 2017, Walmart has a total of about 20 Million products on sale and Amazon has 371 million products. Walmart still has only 5.4% of the products that Amazon has to offer. Walmart enjoyed its best quarter in 4 years with its 2016 Q4 earnings hitting $133.6 billion. In e-commerce, they had a 29% increase boosted by the acquisition of Jet.com, […]

How to scrape Yelp Business Details using Python and LXML

How to scrape Yelp Business Details using Python and LXML

This tutorial is a follow-up of How to scrape Yelp.com for Business Listings using Python. In this tutorial, we will help you in scraping Yelp.com data from the detail page of a business. You can use URLs of businesses you are interested in OR the ones you got from part one of this tutorial. Let’s […]

Number of Products sold on Amazon.com- February 2017

Number of Products sold on Amazon.com- February 2017

Amazon has a total of 362,160,574 products as on February 4th, 2017 That’s 5% less than January 2017. Amazon had 398 Million products on January 4, 2017. Top 10 Categories The Digital Music category is the largest, with nearly 65.6 Million products. The Digital Music category has surpassed Electronics by 14 million products compared to last […]

Data Extraction Services – an essential guide and checklist

Data Extraction Services – an essential guide and checklist

Data is everywhere but most of it is unusable because it is not in a format that can be used. Data extraction services help tap the vast data resources available online or within internal sources and extract the data so that it can be used to benefit the business. This post is a data extraction services […]

Number of Products sold on Amazon vs Walmart- January 2017

Number of Products sold on Amazon vs Walmart- January 2017

As of January 2017, Walmart has a total of 16,859,211 products on sale compared to Amazon.com’s 356,901,798 products. It’s clear that Amazon is still miles ahead of Walmart in the eCommerce space. Walmart has only 4.7% of the products that Amazon has to offer. Walmart vs Amazon (Products for sale) Here is a comparison of […]

The best data and file formats for scraped data

The best data and file formats for scraped data

The data we provide comes in various forms from the source and is largely text (barring rich media such as images and videos or proprietary file formats such as PDFs). Our customers need this data in various formats and the key to a successful and scalable solution that fits the best data formats for web […]

How many products are sold on Amazon.com – January 2017 Report

How many products are sold on Amazon.com – January 2017 Report

Amazon.com has a total of 398,040,250 Products as on January 4, 2017. That is 8% more products than the previous month. Amazon had 368 Million products on December 1 2016. Top 10 categories Below is a bar chart of the Top 10 Categories with the most number of products on Amazon.com. The Electronics department is HUGE, […]

Top 10 US destinations New Year’s Eve Hotel Price spikes

Top 10 US destinations New Year’s Eve Hotel Price spikes

Ever thought about watching the ball drop in Times Square on New Year’s eve in New York City? Well you’re not alone – a lot of people are thinking the same way and unless you have a thousand dollars for a hotel, it will just stay a dream. We analyzed average hotel prices in the […]

Turn the Internet into meaningful, structured and usable data