A Dockerized Big Data Architecture for Sports Analytics

Ozguven, Yavuz; Gönener, UTKU; Eken, SÜLEYMAN

doi:10.2298/csis220118010o

A Dockerized Big Data Architecture for Sports Analytics

Atıf İçin Kopyala

Ozguven Y. M., Gönener U., Eken S.

COMPUTER SCIENCE AND INFORMATION SYSTEMS, cilt.19, sa.2, ss.957-978, 2022 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 19 Sayı: 2
Basım Tarihi: 2022
Doi Numarası: 10.2298/csis220118010o
Dergi Adı: COMPUTER SCIENCE AND INFORMATION SYSTEMS
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Applied Science & Technology Source, Computer & Applied Sciences, INSPEC, Directory of Open Access Journals
Sayfa Sayıları: ss.957-978
Kocaeli Üniversitesi Adresli: Evet

Özet

The big data revolution has had an impact on sports analytics as well. Many large corporations have begun to see the financial benefits of integrating sports analytics with big data. When we rely on central processing systems to aggregate and analyze large amounts of sport data from many sources, we compromise the accuracy and timeliness of the data. As a response to these issues, distributed systems come to the rescue, and the MapReduce paradigm holds promise for largescale data analytics. We describe a big data architecture based on Docker containers with Apache Spark in this paper. We evaluate the architecture on four data-intensive case studies in sport analytics including structured analysis, streaming, machine learning approaches, and graph-based analysis.