All Articles

Analyzing Top Online Brands in the Apparel Industry

Analyzing Top Online Brands in the Apparel Industry

To gain some insights into the top online brands in fashion and clothing stores, we analyzed about 45k products from some of the top brands with an online store – Nike, Ralph Lauren, Gap (Old Navy, Banana Republic) and Levi Strauss Top Online Brands – Gender vs. Number of Products Looking at the number of […]

Number of Products Sold on Amazon.com- June 2017

Number of Products Sold on Amazon.com- June 2017

Amazon has a total of 372 million products as of June 2017, an increase of 1.1%. The Clothing, Shoes & Jewelry dominates Amazon’s product count and Alexa Skills increases by 19.1%. Check out our post to read more about Amazons category counts.

Amazon vs Walmart- Products Sold in May 2017

Amazon vs Walmart- Products Sold in May 2017

As of May 2017, Walmart has a total of 26.1 million products while Amazon has a product count of 367 million products up for sale.  That’s only 7.11% of what Amazon has to offer Here is a comparison of the number of products on sale in Amazon and Walmart  Amazon Keeps Edge Amazon is now worth two […]

Dark Data, Apple and HBO

Dark Data, Apple and HBO

Apple recently acquired Lattice Data for $200 million. Lattice Data specializes in trying to make sense of “Dark Data” and Apple thought this space was valuable enough to spend 200 million. Apple has picked up Lattice Data, a company that applies an AI-enabled inference engine to take unstructured, “dark” data and turn it into structured (and […]

Number of Products Sold on Amazon.com- May 2017

Number of Products Sold on Amazon.com- May 2017

Amazon.com has a total of 368,280,021 products on sale on May 4 2017 That’s 9.3% more than April 2017. Amazon had 335 million products on April 4th, 2017. Top 10 Categories   Digital Music is leading in the top 10 categories, with Home & Kitchen and Electronics department following behind.   Below are the subcategories of Digital […]

Sports Data – The Rise of Big Data and Analytics

Sports Data – The Rise of Big Data and Analytics

The global sports market is huge with its total revenue projected to be around 90 billion dollars in 2017. In sports big data and web scraping is one of the sectors in which data analytics has demonstrated great value and has a great potential with major professional sports teams putting them to use. It is […]

Amazon vs Walmart – Products Sold in April 2017

Amazon vs Walmart – Products Sold in April 2017

As of April 2017, Walmart has a total of 23.5 million products on sale while Amazon has 332 million products. Walmart has only 7% of what Amazon has to offer. Walmart is the second-largest online mass merchandiser behind Amazon, but it’s only a distant second. With $136 billion in total sales last year, Amazon has […]

How to Solve Simple Captchas using Python Tesseract

How to Solve Simple Captchas using Python Tesseract

CAPTCHA stands for Completely Automated Public Turing test to tell Computers and Humans Apart. As the acronym suggests, it is a test used to determine whether the user is human or not. A typical captcha consists of a distorted test, which a computer program cannot interpret but a human can (hopefully) still read. This tutorial will […]

Number of Products Sold on Amazon.com – April 2017

Number of Products Sold on Amazon.com – April 2017

Amazon.com has a total of 335,765,099 million products as on April 4th, 2017. That is only 1% less than March 2017. Amazon had 337 million products on March 7th, 2017. Top 10 Categories Over the past 3 months, Digital Music has been dominating in the top 10 categories with a total of 67.1 million products, a slight growth […]

How to Parse Addresses using Python and Google GeoCoding API

How to Parse Addresses using Python and Google GeoCoding API

Web scraping can often lead to you having scraped address data which are unstructured. If you have come across a large number of freeform address as a single string, for example – “9 Downing St Westminster London SW1A, UK”,  you know how hard it would be to validate, compare and deduplicate these addresses. To start […]

How to Scrape Expedia using Python and LXML

How to Scrape Expedia using Python and LXML

Learn how to scrape flight details from Expedia.com, a leading travel and hotel site, using Python 3 and LXML in this web scraping tutorial. You’ll learn how to extract flight details such as flight timings, plane names, flight duration and more for a given source and destination.

Amazon vs Walmart- Products Sold in March 2017

Amazon vs Walmart- Products Sold in March 2017

As of March 2017, Walmart has a total of 22 million products on sale and Amazon has 375 million products. Walmart has only 5.8% of what Amazon has to offer. Competition with Amazon Walmart has seen a quantum leap with e-commerce sales picking up. With Walmart’s earnings in the fourth quarter seeming to be a definite blow to Amazon, […]

Turn the Internet into meaningful, structured and usable data