`
Etsy, Inc. an American e-commerce company focused on handmade or vintage items and craft supplies. These items fall under a wide range of categories, including jewelry, bags, clothing, home décor and furniture, toys, art, as well as craft supplies and tool. There are millions of transactions being generated on the entire platoform. We are interested to get some understanding about how many sales are genrated over some fixed tiem intervals and who are the top performing shops in temrs of sales volume. We are also interested in which shops are are the most liked or followed shops, and how many customer reviews they receive.
However, to address the above questions is definitely not an easy task. Luckily, with our modern AI based web mining technology, this is no longer something undoable. Our AI based web mining technology is a technology that combines web page analysis, web data crawling or scraping , content extraction, raw data parsing and cleanning etc, the entire data collection process is executed by our AI powered intelligent system.
As already briefly described here – https://barkingdata.com/?etsy-top-shops/ , Now let’s take a more closer look at what and how we can track and extract from the shops. First, let’s open a random shop’s profile page, please note that this page is a public page and can be accessed by any ordinary PC desktop or mobile device user.
Let’s scrape some public info about the shop such as total sales, total likes, review count, ratings etc. and store these data atrributes into a relational database such as MySQL/Sybase/Oracle etc, then repeate the scrape for the other shops as well! Since the purpose of this blog is to demonstrate what we can do with the data, we are not going to talk about too much about the data collection process.
After a week or two, we come back to extract the data again and we will have two rounds of data in the database. Now lets’ start the data processing part. First, we use internal tools to clean up the raw data a little bit to make it better quality for the subsequent analytics part. After cleaning up, we then use some python script to calculate the sales volume difference between the two rounds of raw data. Lastly, we use business intelligence and data visualization tools to apply the final analysis and aggreation to the dataset and plot some charts.
The chart below gives a clear picture of the top selling shops between the two rounds of data collection.
Also in the raw data we can collect the shop create date for each shop. We can also easily make a plot for shops create date.
We can get a pretty clear picture of how new shops get created on the platform over time. From the chart above , it telsl us that in 2020 when Covid started, the number of new shops also went up quite a bit during the year of the pandemic started.
Similarly, we can also plot more charts for other metrics. I shared some top selling shops to download for free, please download from here:
https://www.kaggle.com/datasets/polartech/1m-etsy-shops-sales-data
Indestry's Lowest Priced Google SERP API Service, Scrape Google SERP Anonymously and Consistently
Web Scrape Google Flights Data to Get Real Time Airline TIcket Pricings and Flights Schedules
Web Cralwer to Extract Product and Category Data from Top Fashion Website Nordstrom.com
Web cralwers to harvest food delivery data from Ubereats, doordsash, grubhub ...
Web Crawlwers to scrape homedepot.com for product listings and product details data
Web crawlers to scrape China hotels data from top hotel websites such as holidayInn, Ctrip etc.
Grab Holdings Inc., commonly known as Grab, is a Southeast Asian technology company headquartered in Singapore and Indonesia. In addition to transportation, the company offers food delivery and digital payments services via a mobile app. Grab currently operates in Singapore, Malaysia, Cambodia, Indo
Collect millions of realestate data from Thailand major realEstate website ddproperty.com
Web crawlers to scrape lazada for product listings data and category data
Web Crawlers to Scrape Global Interste Rate, Mortgage Rate, Deposit Rate
One of the industry's best Web Crawlers(Service) for China Major Ecommerce Websites such as Tmall, JD, Kaola, PinDuoDuo etc.