`
Roblox is an online game platform and game creation system developed by Roblox Corporation that allows users to program games and play games created by other users.Roblox is free to play, with in-game purchases available through a virtual currency called Robux. As of August 2020, Roblox had over 164 million monthly active users, including more than half of all American children under 16.
In this blog our aim is to explore correlations between dataset attributes on Roblox. The main focus is on changes in the number of players across different time series, including weekly, monthly and hourly changes in the number of online players. The raw data used in this analysis is collected via our AI based web mining technology. Our AI based web mining technology is a technology that combines web page analysis, web data crawling or scraping with artifical intelligence , data parsing and cleanning etc, the entire data collection process is executed by our AI powered intelligent system. Read more about our AI based web mining technology.
After collecting the data, lets say we save the data into a csv file named roblox_popular_rank_2022-03.csv. Regarding this file, it contains a total of 17 attributes and 3389876 rows of data. Among these attributes, we pay close attention to "playerCount", "name" and "RunDate" to see how the online players count result changes throughout March and throughout the day. For the file we chose to use, its data was organized in detail, the data structure was clear, the three datasets generated had meaningful data, the columns used had no obvious missing values or errors, and only two lines contain the none value of the selected attribute, which is perfectly analyze.
Tasks
Two main analysis goals will be achieved: 1. The first task is to find a suitable proportional number that could represent 90% of the total player. 2. Extract and process data based on Task 1 to reflect how players calculate hourly and daily changes for March
Daily Plays Count Analysis
Import data and select columns to be used:
Now do some data cleanup:
Since there are multiple datasets per day, in order to ensure the validity of the data, we create Figure 1 to figure out how much data should be used for the analysis task of this part. To get this proportion result, after sorting the player count attribute from largest to smallest, calculate each part’s proportion and draw the above graph. Core code shared below:
Figure 1 shows that top 5% of player count of dataset can represent 90% of the total, so top 5% of games of this dataset would be extracted for further analysis.
For completing Task 1, we extract top 5% of the sorted dataset and plot Figure 2 as the output of Task 1:
Essentially, there is a clear cyclical change in the data from week to week. Figure 2 and Figure 3 are ploted to show the change in the number of online players in March, reflecting a similar situation--with Monday being the day with the fewest online players, while there is a boost from Friday to Sunday, with the number of players peaking on Saturday.
Hourly Player Count Analysis
Similarly, we can easily plot a chart to study the hourly player count trend as below:
It it noticeable that players prefer to play at 18:00, and 6:00 is the time when the least people play. Last but not least, there is a clear upward trend in the number of players (average and sum) in the afternoon
Now let's pick the top 10 most popular games and plot a similar chart like the above:
It is worth noting from this chart that the game called Brookhaven RP is the most popular one in March that has a one-day trend similar to the ones we have plotted earlier; and also the same pattern for games whose play count is below 100,000.
Indestry's Lowest Priced Google SERP API Service, Scrape Google SERP Anonymously and Consistently
Web Scrape Google Flights Data to Get Real Time Airline TIcket Pricings and Flights Schedules
Web Cralwer to Extract Product and Category Data from Top Fashion Website Nordstrom.com
Web cralwers to harvest food delivery data from Ubereats, doordsash, grubhub ...
Web Crawlwers to scrape homedepot.com for product listings and product details data
Web crawlers to scrape China hotels data from top hotel websites such as holidayInn, Ctrip etc.
Grab Holdings Inc., commonly known as Grab, is a Southeast Asian technology company headquartered in Singapore and Indonesia. In addition to transportation, the company offers food delivery and digital payments services via a mobile app. Grab currently operates in Singapore, Malaysia, Cambodia, Indo
Collect millions of realestate data from Thailand major realEstate website ddproperty.com
Web crawlers to scrape lazada for product listings data and category data
Web Crawlers to Scrape Global Interste Rate, Mortgage Rate, Deposit Rate
One of the industry's best Web Crawlers(Service) for China Major Ecommerce Websites such as Tmall, JD, Kaola, PinDuoDuo etc.