\
ISSN: 2455-5479
##### Archives of Community Medicine and Public Health
Research Article       Open Access      Peer-Reviewed

# Data analysis of online shopping platform during the COVID-19 epidemic of Coronavirus disease

### Bin Zhao1* and Jinming Cao2

1School of Science, Hubei University of Technology, Wuhan, Hubei, China
2School of Information and Mathematics, Yangtze University, Jingzhou, Hubei, China
*Corresponding author: Dr. Bin Zhao, School of Science, Hubei University of Technology, Wuhan, Hubei, China, Tel/Fax: +86 130 2851 7572; E-mail: zhaobin835@nwsuaf.edu.cn
Received: 31 Octomber, 2020 | Accepted: 25 January, 2021 | Published: 28 January, 2021
Keywords: Data analysis; BP neural network; Fuzzy evaluation; Index system

Cite this as

Zhao B, Cao J (2021) Data analysis of online shopping platform during the COVID-19 epidemic of Coronavirus disease. Arch Community Med Public Health 7(1): 023-032. DOI: 10.17352/2455-5479.000229

Data analysis of online shopping platform, and With the development of online platform, more and more consumers will choose this convenient way of online shopping. This paper uses Spyder’s time-based model to mine the rating data of Amazon online shopping platform, establishes a neural network model, analyzes the connection relationship of each rating index, carries out descriptive statistical analysis on each index, obtains the correlation results between the impact indicators, and carries out fuzzy evaluation, analyzes the impact of each evaluation index on the product, and finally combines the product’s The relationship between sales situation and rating provides reliable product sales design mode for Amazon, and gives some sales suggestions, so as to enhance the product’s desirability.

### Introduction

With the improvement of people’s material life quality, more and more consumers have been favored high quality and high praise items, and become the first choice of people’s daily shopping. After purchasing, consumers grade the products through star rating and experience comments on the shopping website, and select high-quality products. The managers of shopping websites can understand the shortcomings of their products in other websites of the same type through consumer reviews, so as to grasp the advantages and disadvantages of product sales and clarify the development direction of products. For consumers, the evaluation results can let them understand the specific ranking and advantages of different products of the same type, match with their own needs, so as to obtain better shopping. After purchasing the products, consumers also participate in the evaluation process of the goods, and as a group with the most say in the evaluation, they express their choices and wishes, so as to promote the businesses to find themselves. As a result, consumers get a better shopping experience. These two groups are opposite to each other and promote each other, forming a virtuous circle.

With the improvement of people’s material life quality, more and more consumers have been favored high quality and high praise items, and become the first choice of people’s daily shopping. After purchasing, consumers grade the products through star rating and experience comments on the shopping website, and select high-quality products. The managers of shopping websites can understand the shortcomings of their products in other websites of the same type through consumer reviews, so as to grasp the advantages and disadvantages of product sales and clarify the development direction of products. For consumers, the evaluation results can let them understand the specific ranking and advantages of different products of the same type, match with their own needs, so as to obtain better shopping. After purchasing the products, consumers also participate in the evaluation process of the goods, and as a group with the most say in the evaluation, they express their choices and wishes, so as to promote the businesses to find themselves. As a result, consumers get a better shopping experience. These two groups are opposite to each other and promote each other, forming a virtuous circle.

##### The model of problem 1

Selection principle of indicators: The online shopping platform is complex. In order to get the product comment information scientifically and reasonably, the selection of indicators is a crucial step. Only when the indicators that can accurately measure the product are selected can the model be established [1], it should follow the following five principles should in the selection process Figure 1.

##### Principle

1) Scientific principles: the selection of measurement indicators must base on scientific principles, and can truly and objectively reflect the impact of each element on the selection of indicators.

2) Practical principle: the construction of the evaluation system is mainly theoretical analysis, which will be affected by the data sources of various indicators in practical application. Therefore, it should guarantee the availability and reliability of data sources in the process of selecting indicators.

3) The principle of system: there should be a certain logical relationship between the indicators, not a single index, but a system of product evaluation information, they should not only reflect the sales and praise of the three products from different aspects, but also form a systematic organic whole.

4) Principle of comparability: different types of index data should conform to comparability, so the evaluation index system constructed conforms to universality.

5) The principle of relevance: three products selection of evaluation index system should be the combination of a series of indicators, in each of the products under the background of stars and comments, through the analysis of each product related comments properly and evaluation, not only can evaluate the efficacy of the product of actual sales, but also can judge the trend of three kinds of product sales.

Based on the above selection principles of five evaluation indexes, figure shows the overall framework of the comprehensive evaluation index system of the market products studied in this paper Figure 2.

Selection of evaluation indexes: According to the scientific selection principle, combined with the relevant selection criteria of product evaluation body wash index [2] and the comments and sales of three products provided by sunshine company, the evaluation indexes that can better express the sales characteristics of products are selected as follows Table 1.

##### Credibility

Because credibility relates with many factors, when describing credibility, combined with the given data and conditions, the following three aspects describe the credibility: whether Amazon members are authenticated, whether products are confirmed to be purchased and whether the number of votes is useful Figure 3.

In order to further analyze the influence of credibility and determine the influence weight of credibility, three factors affecting credibility are assigned and defined. The results are as follows Table 2.

Set useful voting as Ii and useless voting as Ji and obtain the functional relationship of the three products under the selected index with respect to reliability as follows:

Where, Fh is Credibility score ; a1, a2, a3 are the undetermined coefficient.

In order to select the indexes that have the greatest impact on the product evaluation system, the above four indexes were reduced by principal component analysis. The results are as follows:

##### Star rating

The company has designed two kinds of star rating for different objects, one is store feedback, the other is list, that is, product review. Customers can rate the store and products after purchasing goods to express the satisfaction of this shopping. After customer rating, the company will take the average number for the star rating given. Since the stars shown are the average number, half or most of the stars will appear.

The sales of the product will be judged by the star rating. The evaluation grade is 1-5 stars from low to high, which is directly converted into a judgment index. Finally, the sales situation of the products is determined by calculating the average stars. The evaluation grade is 1-5 stars from low to high, which is directly converted into a judgment index. Finally, the sales situation of the products is determined by calculating the average stars. Indicators are classified as follows Table 3.

The calculation method of star rating is as follows:

Where Fs is star rating; θI is the Number of stars ; βi is star index.

The comment Title often determines whether consumers are willing to continue reading and browsing the comment, so the importance of the title is obvious. When evaluating the comment title, we can consider the length of the text, the number of commodity characteristic words, the number of negative emotional words, etc.

In order to judge the evaluation index formed by the evaluation title more accurately, we will use the text length, the number of characteristic words of goods, and the respective weight of the number of negative emotional words to describe the evaluation title.

Let the weight of text length (L) be W1, the weight of characteristic word quantity (N) of commodity be W2, and the weight of negative emotion word quantity (M) be W3, respectively. Syder software is used to screen the overall reviews of the three products, and the descriptive statistics of text length, the number of characteristic words of goods and the number of negative emotion words are as follows Tables 4-6.

Through statistical analysis of three product indexes, we can get

Where a, b and C are the weight of text length, quantity of commodity characteristics and quantity of negative emotion words.

Model building: According to the three evaluation indexes selected above, a BP neural network model is constructed, and the process of establishing the model is shown in Figure 4.

##### Parameter setting of BP neural network model

1) Network layer number: Kolmogorov theorem [3-5] points out that in theory, three- layer neural network can fit any continuous nonlinear function. In order to simplify the model, this paper uses three-layer neural network model.

2) Set input layer: Four evaluation criteria are selected to describe the product, so the number of input layer neurons is 4.

3) Number of neurons in the hidden layer: There is no fixed algorithm for calculating the number of neurons in the hidden layer of the model, and the number is closely related to the number of input layer and output layer, which needs to be determined by experience and multiple tests. The number of neurons in the hidden layer is 4, so the number of neurons in the hidden layer is 4.

4) Output layer setting: The output result of the shopping evaluation model in this paper has only one comprehensive score about the product, so the output layer setting has only one neuron [5-8].

##### BP The solution of neural network evaluation model

Step 1: The connection weights between neurons in each layer of network initialization vij, wjk, each weight value is assigned an interval random number in (-1,1), given calculation accuracy ε and maximum learning times M, give hidden layer threshold aj and output layer thresholds bk.

Step 2: Input sample $X=\left({x}_{1},\dots ,{x}_{n}\right)$ and the corresponding expected output.$D=\left({d}_{1},\dots ,{d}_{n}\right)$

Step 3: Hidden layer output calculation. According to the input vector X, connection weight between input layer and hidden layer Vjk, and hidden layer threshold a, calculate hidden layer output.

Where m is number of hidden layer nodes vi0 = -1,x0 = aj, f (.) is the implicit layer transfer function.

Step 4: Output layer output calculation. Output y according to the hidden layer, Connection weight wjk and threshold value b,Calculating the actual output O of BP neural network.

Step 5: Error calculation. According to the actual output O and expected output D of the network, the overall error E of the network is calculated.

Step 6: Weight update. According to the overall network error E, update the network connection weight according to the following formula wjk,vjk.

$\Delta {v}_{ij}=\eta \left(\sum _{k=1}^{l}{\delta }_{k}^{0}{w}_{jk}\right){y}_{j}\left(1-{y}_{j}\right){x}_{i}$

$\Delta {w}_{jk}=\eta {\delta }_{k}^{0}{y}_{j}$

Where

${\delta }_{k}^{0}=\left({d}_{k}-{O}_{k}\right){O}_{k}\left(1-{O}_{k}\right)$

, in styleis the learning rate.

Step 7: Training and convergence. When the average error of the calculated training sample is less than ε, the whole training is over, otherwise, the above process is repeated, and the weight and threshold are constantly modified. After repeated calculation, the actual output of the network gradually approaches to the corresponding desired output, which is also the process of the global error of the network tending to the minimum. After repeated iterations, when the error is less than the allowable value, the training process of the network ends.

##### Conclusion of question 1

The principal component analysis is carried out to determine whether Amazon members have been certified, whether purchasing power products and voting numbers have been confirmed to be useful, and the evaluation index of credibility is obtained; the evaluation grade is taken as the second evaluation index, and the text length, the number of commodity characteristic words and the number of negative emotional words are mined with Spyder data, and the corresponding values are obtained after descriptive statistical analysis. And get the third evaluation index of the comment title. Then the evaluation model of three product evaluation indexes is established by using BP neural network.

##### The model of problem 2

In order to get the data measure that can best be tracked by the sunshine company from rating and comment, we choose star rating, helpful votes, total votes, and evaluation score as variables to establish four evaluation indexes. We use the fuzzy evaluation theory to discuss these indexes, and finally give their comprehensive impact to determine their final data measure, The specific flow chart of the fuzzy evaluation theoretical model is shown in Figure 5.

##### Establishment of model

Establishment of model

1) Set the factor set U = {µ1234} as the influencing factors of four evaluation indexes to the comprehensive indexes. Where, µ1: refers to the data of star rating; µ2: refers to the data of helpful vote; µ3: refers to the number of comments; µ4: refers to the evaluation score.

2) Select data. Select random 10 comments of purchased goods in each table, set the evaluation set V = {v1,v2,v3, …v45} to represent 30 comments, and calculate the corresponding evaluation degree of each comment through the model of question one.

3) Establish single factor evaluation matrix:

${r}_{ij}=\frac{{x}_{ij}}{\sum _{j=1}^{45}{x}_{ij}}\left(i=1,2,3,4;j=1,2,\dots 45\right)$

4) Single factor weight

Set the weight of each evaluation factor λ as:

λ= [λ1,λ2,λ3,λ4]

5) The choice of different models in fuzzy theory:

In the fuzzy evaluation of body shape, different principles correspond to the selection of different models:

Solution one: M (∨,∧) (Main determinant)

${b}_{j}=\underset{i=1}{\overset{4}{\vee }}\left({a}_{i}\wedge {r}_{ij}\right)$

Solution two: M (•,∧) (Main factor prominent type)

${b}_{j}=\underset{i=1}{\overset{4}{\vee }}\left({a}_{i}•{r}_{ij}\right)$

Solution three:M(∧⊕) (Main factor prominent type)

${b}_{j}=\underset{i=1}{\overset{4}{\vee }}\left({a}_{i}\wedge {r}_{ij}\right)$

Solution four: M (•,+) (Weighted average model)

${b}_{j}=\underset{i=1}{\overset{4}{\vee }}\left({a}_{i}•{r}_{ij}\right)$

According to this problem, we use a more suitable weighted average model, that is, solution 4.

1. Define the weight coefficient. It can be seen from the reality that the comprehensive evaluation bj will be positively correlated with the three indicators of star rating, helpful votes and evaluation score, and the user will choose to watch helpful votes and comments. Total votes includes helpful votes and some voting data with negative correlation. Therefore, the definition λ= [10,1,-0.5,0.05].

Comprehensive evaluation:.

${b}_{j}=\sum _{i=1}^{4}\left({\lambda }_{i}•{r}_{ij}\right)$

Solution: first normalize the sample data, then normalize the whole data, and then substitute the obtained value into the formula to get the compreheensive data value bj.

##### Solution of model

Make correlation analysis between the comprehensive index data of sample data bj and the evaluation degree of sample data, and the results are shown in Table 7.

Conclusion: The analysis shows that the correlation between the two models is basically the same, which confirms the accuracy of the neural network model of question one and this model. Through this model, we can accurately provide data measurement based on rating and comment for sunshine company, and sunshine company can analyze the market of goods according to these measurement.

##### The model of problem b

Model establishment and solution: This model adds time measurement mode, and establishes time rating model by using the evaluation reliability index discussed in question 1.

Because the recognition and discussion of three product data sets based on time measurement and pattern are similar, only the blower is discussed in detail, but the data recognition process is similar, so the time rating model of the blower is analyzed carefully, which is rough in the analysis of microwave oven and pacifier, but also gives the analysis results in detail and clearly.

##### Establish time rating model for hair dryer

According to the evaluation grade, evaluation title and evaluation equation of question 1:

Evaluation level:

${F}_{s}=\frac{\sum _{i=1}^{5}{\theta }_{i}{\beta }_{i}}{\sum _{i=1}^{5}{\theta }_{i}}$

Evaluation title:

$\gamma =aL+bN+cM$

Known by question 1:

$\gamma =0.356L+0.218N+0.435M$

Through the time series analysis and prediction of SPSS, we can respectively get the observation chart of the star change trend of the blower based on the time measurement as shown in Figure 6, and the observation chart of the comprehensive change trend based on the evaluation score and star level under the time measurement as shown in Figure 7, as well as their influence chart, namely the overall change trend chart, as shown in Figure 6 and Figure 7.

According to the consumer’s star level change and the overall change trend chart, we can know that the star level is on the rise. From 2013 to 2014, the rating rose rapidly, but it was also in a rapid decline stage in the same year, but the overall trend was still on the rise. In other words, the higher the star rating of consumers is, the higher the value is, the greater the reputation of products will be, and the greater the impact of consumers’ purchase decisions will be.

In order to improve the analysis and the reliability of the analysis results, in view of this problem, the evaluation star level evaluation score is also considered comprehensively. After the correlation analysis, it is considered that the evaluation star level and the evaluation score are related to a certain extent, so under the time measurement, the change trend is observed, and then the star level change under the time measurement is analyzed separately. Trends are compared for reliability of results. After the software analysis, we can get the change trend and the overall trend as shown in the Figures 8,9.

According to the evaluation score of the hair dryer by consumers and the comprehensive trend chart of star level changes, it can be seen that the comprehensive change shows a downward trend at the end of 2015, but on the whole, it is still an upward trend. It can be seen from the figure that the evaluation of hairdryer by consumers reached the peak of evaluation score and star rating in 2015, and then declined. This trend is similar to the change trend of star rating and evaluation score from 2011 to 2012, so it is not ruled out that the problem of product quality and consumers’ evaluation psychology of purchasing goods. In short, when analyzing the comprehensive evaluation trend of evaluation score and star rating, and analyzing the comprehensive indicators of star rating and evaluation score, it can be concluded that the reputation of products will decline briefly after 2016, and then increase rapidly, but its reputation is increasing in the online market in 2015 Figure 10.

Because the time series analysis method of evaluation and rating of microwave oven, pacifier and blower is similar, there is no detailed explanation when analyzing microwave oven and pacifier.

##### Establish time rating model for microwave oven

Known by question 1:

The time-based measurement and pattern are identified in the data set of microwave ovens. Because the comprehensive index of star rating and evaluation score can better reflect the increase and decrease of product reputation in the online market when the data set of hair dryer is analyzed and discussed, the comprehensive index of star rating and evaluation score is directly considered in the analysis of microwave ovens, and the star to product is not considered separately Influence.

According to the data set of microwave oven, after preprocessing the missing value and time series, we can get the comprehensive trend chart of evaluation score and star level based on the measurement of time series as shown in Figure 11.

According to the comprehensive trend chart of microwave oven evaluation score and star rating based on time series measurement, the reputation of microwave oven is slowly decreasing in the online market at this stage.

##### Time rating model for pacifier

Known by question 1 :

$\gamma =0.188L+0.265N+0.547M$

According to the data set of the pacifier, after preprocessing the missing value and time series, we can get the comprehensive trend chart of evaluation score and star level based on the measurement of time series as shown in Figure 12.

According to the comprehensive trend chart of evaluation score and star rating of nipple based on time series measurement, the reputation of nipple is slowly increasing in the online market at this stage.

##### Conclusion of question b

Based on the above analysis, it can be concluded that the reputation of hair dryer is increasing in the online market, the reputation of microwave oven is slowly decreasing, and the reputation of pacifier is slowly increasing.

##### The model of problem c

Analysis of model: In order to better analysis of the product in a potential success and potential failure, we choose the most can reflect real product quality indicators star-rating, evaluation score, number of comments, the five-star rating proportion. See each item as a high- dimensional space of points, each evaluation index represents the dimension on this point, using the comprehensive evaluation method of fuzzy theory, the commodity properties of fuzzy similarity to high point, construct the fuzzy clustering model.

##### Establishment of model

Data selection: In order to avoid the volatility of evaluation indexes caused by too few data and ensure that the number of comments on each model is more than 20, we randomly select 20 products from three categories as samples and take the average value of sample indexes for analysis.

A total of 20 commodities are input into a two-dimensional matrix with respect to three variables, which is called the observation matrix:

The column vector Wi = (i=1,2,…20) represents the value of the evaluation variable corresponding to each commodity, and the row vector Yr(r = 1,2,3) represents the value of the sample data with respect to a certain evaluation variable.

##### Steps of fuzzy clustering model

Step 1: Establish observation data matrix W for sample commodities;

Step 2: The data matrix of the sample is standardized to unify the data structure. In this paper, the standard deviation change method is used to process:

${{W}^{\prime }}_{i}=\frac{{W}_{i-{W}_{ave}}}{\sigma }$

among them, Wave is the average value of four evaluation indexes of sample data, σ is the standard deviation.

Step 3: There are many ways to calculate the relative distance of commodity between points in matrix space, such as Mahalanobis distance, absolute distance, Euclidean distance, etc.This paper adopts the most simple and practical European distance:

${d}_{ns}\left(2\right)={\left[\sum _{i=1}^{M}{\left({Z}_{ni}-{Z}_{si}\right)}^{2}\right]}^{\frac{1}{2}}$

Thus

${d}_{ns}^{2}=\left({W}_{n}-{W}_{s}\right)\left({W}_{n}-{W}_{s}{\right)}^{\prime }$

Step 4: The average distance method and the shortest distance method can be used to calculate the distance between space classes.

Step 5: Through the sample matrix data, the above steps of fuzzy clustering model are carried out, and MATLAB is used to solve the problem, then the specific results can be obtained.

##### Solution of model

Because the scalar quantity we selected is positively related to the sales volume of the product, we select the largest data [5400,1] in the sample data as the success point Ws and calculate the result through the fuzzy clustering model as shown in the figure below Figure 13.

Take the Euclidean distance between goods d= 0.6 and d = 1.2, respectively, as the dividing line between possible successful products and failed products. At that time, d<0.6, the potential success of the product was indicated. At that time, d>0.6 the product failed.

##### The model of problem d and e

Index correlation analysis: In the sales of products in various aspects, star and review is the customer after the purchase of goods for the evaluation of the performance of the product, can objectively reflect the product quality or not. The amazon review the star rating of the basic principle is the addition of the positive and negative, and then according to the A9 algorithm weighted average, finally it is concluded that the shop star digital, has a certain scientific nature and accuracy.

In order to explore whether there is a certain influence relationship between star rating, text rating and product rating, the first question is related to mining the selected text data, and the data is statistically analyzed to see whether there is a correlation between each star rating and review text and rating. Therefore, the text types are divided as follows:

1. The division of the five star indexes is intuitively divided into five weight standards according to the division standard in question one;

2. Through the word frequency statistics in the text comment, assign value weight to the characteristic words;

3. The number of valid votes is extracted as the score index of the product and the weight standard is obtained.

Pearson correlation analysis was carried out on the divided indexes, and the correlation results among the three product index ratings were obtained as follows:

It can be seen from Tables 8-10 that the significance between the stars, reviews and scores of the three products is P= 0.00<0.05, indicating that there is a linear relationship; the correlation between the three product evaluation indexes is greater than 0.7, indicating a high correlation, and the three products are closely related.

##### Answer to problem d and e

In order to determine the specific linear relationship between stars, reviews and ratings, the further relationship between the three indicators of each product was determined and the impact was predicted Figures 14-16.

Can be seen from the diagram, three star ratings, reviews and comments, there is an obvious linear relationship between the content of the comments and ratings with the increase of commodity star, tend to be more high praise, content and score also gradually rise, for those low star products, relatively, comments are more negative content, grading is low.

After evaluation of the products is an interconnected system, after customers to buy goods on the star rating, help review and product grade, interactions between the three, the size of the star indicators will affect the customer comments on the products and subsequent rating level. Star index is higher, affect the customer comments on the product content length, the more customers will write more positive emotional words, such as good, happy, and better, to express his love for the product, that matter, at the same time can also affect the customer to product good comments score, according to the customer’s use of the products and comments are Face effects give them higher ratings.

### Conclusion

This study is mainly based on the neural network model of Amazon Product sales strategy analysis.Through the screening and analysis of each index by BP neural network, fuzzy evaluation theory and time rating model, the market heat and Prospect of the three products are judged. It can be seen that higher star index will lead to more favorable comments and higher scores, and customers will write more enthusiastic words about good, well and excellent, It reflects the sales volume of the products, and then provides the company with online sales strategy and affirms the time-based mode. The text data helps the company to interact in the way of manufacturing successful products, which can also be used for the evaluation of Shopping platform quality, diagnosis and treatment of Shopping platform-related diseases.

This work was supported by the Philosophical and Social Sciences Research Project of Hubei Education Department (19Y049), and the Staring Research Foundation for the Ph.D. of Hubei University of Technology (BSQD2019054), Hubei Province, China.

1. Kun L (2018) Analysis and Research on emotional tendency of commodity review based on expression skills. China University of Mining Technology 2018.
2. Xu X, Liu W, Gursoy D (2018) The impacts of service failure and recovery efffforts on airline cust omers’ emotionsand satisfaction. J Travel Res 58: 1034-1051. Link: https://bit.ly/36hGJmw
3. Ajzen I, Fishbein M (1977) Attitudebehavior relations: A theoretical analysis and review of empirical re search. Psychological Bulletin  84: 888-918.Link: https://bit.ly/3qSpeRI
4. Yaqi C, Tian J, Wang L (2014) Research on the relationship between online reviews and consumers' purchase intention. Jiangsu University of Science and Technology. Link: https://bit.ly/2YnzQvG
5. Shouren H (1993) Introduction to neural network. Changsha: National University of Defense Science and Technology Press 113-120.
6. Minna N, Zhihong S, Zi W, Shujia L, Jing H (2016) Application of BP neural network model for product modeling perceptual image evaluation. Journal of Donghua University (NATURAL SCIENCE EDITION) 42: 604-607.
7. Ming C (2015) Research on R & D capability evaluation of enterprises based on neural network. Ocean University of China.
8. Wanli Z (2013) Research on credit risk assessment model of commercial banks based on artificial neural network. Changsha University of Technology.
© 2021 Zhao B, et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.