Document Type : Articles

Authors

1 University of Missouri, Columbia, United States

2 University of South Florida, United States

3 Southeast University, China

Abstract

As Google Analytics becomes increasingly popular, more detailed records of users’ behaviors can be captured and analyzed to better understand the performance of websites. However, current Google Analytics related research usually draws conclusions from rough estimation based on the observation of the dashboard or other basic statistical processing of the data. This study aims to provide a more accurate and informative analysis from both temporal and geospatial perspectives via clustering and GIS application. The results obtained from a resource website case study demonstrate that the proposed method is able to help web managers better examine the temporal effect on users’ visiting patterns based on accurate mathematical computation as well as provides more geographical insight into website performance through the constructed density measure and 3D graphic presentation. By offering in-depth quantitative information relying on mining data from web logs, such a study can help web stakeholders make better decisions on how to maintain and improve the websites, especially adjusting resources by considering temporal fluctuations and inequity in geographical distribution.

  1. Beitzel, S.M., Jensen, E.C., Chowdhury, A., Frieder, O. & Grossman, D. (2007). Temporalanalysis of a very large topically categorized web query log, Journal of the American Society for Information Science and Technology, 58(2), 166-78.
  2. Chi, Y., Zhu, S., Song, X., Tatemura, J. & Tseng, B.L. (2007). Structural and temporal analysisof the blogosphere through community factorization, Proceedings of the ACM SIGKDDInternational Conference on Knowledge Discovery and Data Mining. Available at: http://portal.acm.org/citation.cfm?doid=1281192.1281213
  3. Clifton, B. (2012). Advanced web metrics with Google Analytics. John Wiley & Sons.
  4. Duda, R.O., Hart, P.E., &Stork, D.G. (2001). Pattern Classification, 2nd ed. John Wiley& Sons, Inc., New York.
  5. Everitt, B.S., Landau, S., Leese, M., &Stahl, D. (2011). Cluster Analysis, 5th ed. John Wiley & Sons, Ltd., 2011.
  6. Fang, W. (2007).Using Google Analytics for improving library website content and design: A case study. Library Philosophy and Practice, 9(3), 1–17.
  7. Farney, T. &Mchale, N. (2013). Data Viewing and Sharing: Utilizing Your Data to the Fullest. Library Technology Reports49(4), 39-42. American Library Association.
  8. Guo, R., & Zhang, Y. (2013). Identifying Time-of-Day Breakpoints Based on Non-intrusive Data Collection Platforms. Journal of Intelligent Transportation Systems.
  9. Hess, M. R. (2012). Web Analytics: Using Evidence for Improvement-Über analytics: Customizing Google Analytics to track multiple library platforms.
  10. Jansen, B. J., & Spink, A. (2005). How are we searching the World Wide Web? A comparison of nine search engine transaction logs. Information Processing & Management, 42(1),248–263.
  11. Jones, C., Giersch, S., Sumner, T., Wright, M., Coleman, A. &Bartolo, L. (2004). Developing aweb analytics strategy for the National Science Digital Library, D-Lib Magazine, 10(10). Available at: www.dlib.org/dlib/october04/coleman/10coleman.html.
  12. Kent, M. L., Carr, B. J., Husted, R. A., & Pop, R. A. (2011). Learning web analytics: A tool for strategic communication. Public Relations Review, 37(5), 536-543.
  13. Khoo, M., Pagano, J., Washington, A. L., Recker, M., Palmer, B., & Donahue, R. A. (2008). Using web metrics to analyze digital libraries. InProceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries (pp. 375-384). ACM.
  14. Kirk, M., Morgan, R., Tonkin, E., McDonald, K., &Skirton, H. (2012). An objective approach to evaluating an internet-delivered genetics education resource developed for nurses: using Google Analytics™ to monitor global visitor engagement. Journal of Research in Nursing, 17(6), 557-579.
  15. Kumar, C., Norris, J. B., & Sun, Y. (2009). Location and time do matter: A long tail study of website requests. Decision Support Systems, 47(4), 500-507.
  16. Marek, K. (2011). Chapter 2: Getting to Know Web Analytics. Library Technology Reports, 47(5), 11-16.
  17. Pakkala, H., Presser, K., & Christensen, T. (2012). Using Google Analytics to measure visitor statistics: The case of food composition websites. International Journal of Information Management, 32(6), 504-512.
  18. Patton, A. J., & Kaminski, J. E. (2010). Tracking the impact of your web-based content. Journal of extension, 48(4), 4TOT1.
  19. Petersen, M. G., Iversen, O. S., Krogh, P. G., &Ludvigsen, M. (2004). Aesthetic Interaction: a pragmatist's aesthetics of interactive systems. In Proceedings of the 5th conference on Designing interactive systems: processes, practices, methods, and techniques (pp. 269-276). ACM.
  20. Plaza, B. (2009). Monitoring web traffic source effectiveness with Google Analytics: An experiment with time series. In Aslib Proceedings 61(5), pp. 474-482). Emerald Group Publishing Limited.
  21. Plaza, B. (2011). Google Analytics for measuring website performance. Tourism Management, 32(3), 477-481.
  22. Reber, R., Schwarz, N., &Winkielman, P. (2004). Processing fluency and aesthetic pleasure: is beauty in the perceiver's processing experience? Personality and social psychology review, 8(4), 364-382.
  23. Recker, M., Xu, B., Hsi, S., & Garrard, C. (2010). Where in the World? Demographic Patterns in Access Data. In EDM (pp. 337-338).
  24. Rousseeuw, P.J. (1987). Silhouettes: A Graphical Aid to the Interpretation and Validation of Cluster Analysis. Journal of Computational and Applied Mathematics20(1), 53–65.
  25. Sen, A., Dacin, P.A. and Pattichis, C. (2006), “Current trends in web data analysis”, Communication of the ACM, 49(11), 85-91.
  26. Tibshirani, R., Walther, G., and Hastie, T. (2001). Estimating the Number of Clusters in a Data Set via the Gap Statistic. Journal of Royal Statistical Society, B63, Part 2, 411–423.
  27. Tractinsky, N. (1997). Aesthetics and apparent usability: empirically assessing cultural and methodological issues. In Proceedings of the ACM SIGCHI Conference on Human factors in computing systems (115-122). ACM.
  28. Tractinsky, N. (2013): Visual Aesthetics. In: Soegaard, Mads and Dam, RikkeFriis (eds.). The Encyclopedia of Human-Computer Interaction, 2nd Ed. Aarhus, Denmark: The Interaction Design Foundation. http://www.interaction-design.org/encyclopedia/visual_ aesthetics.html.
  29. Turner, S. J. (2010). Website statistics 2.0: Using Google Analytics to measure library website effectiveness. Technical Services Quarterly,27(3),261-278.
  30. Wang, F.K. &Wedman, J.F. (2001) eThemes: An Internet instructional resource service. Information Technology and Libraries. 20(4), 179-184.
  31. Wang, X., Shen, D., Chen, H. L., &Wedman, L. (2011). Applying web analytics in a K-12 resource inventory. The Electronic Library, 29(1), 20-35.
  32. Xu, B., Recker, M., &Hsi, S. (2010). The data deluge: Opportunities for research in educational digital libraries. Internet Issues: Blogging, the Digital Divide and Digital Libraries. Nova Science Pub Inc., New York.
  33. Zhang, Y., Jansen, B. J., & Spink, A. (2009). Time series analysis of a Web search engine transaction log. Information Processing & Management, 45(2), 230-245.