Home   >   CSC-OpenAccess Library   >    Manuscript Information
Full Text Available

This is an Open Access publication published under CSC-OpenAccess Policy.
Publications from CSC-OpenAccess Library are being accessed from over 158 countries worldwide.
Social Big Data: Techniques and Recent Applications
Tina Tian
Pages - 224 - 234     |    Revised - 30-11-2020     |    Published - 31-12-2020
Volume - 14   Issue - 5    |    Publication Date - December 2020  Table of Contents
Social Big Data, Big Data, Social Media Analytics, Social Media.
In the big data era, large volumes of social media data are generated at a high velocity, which we refer to as social big data. It is beyond the ability of traditional methods and algorithms to manage the massive amount of data in a tolerable elapsed time. In this paper, we present a comprehensive overview of the established big data techniques and new achievements on social big data management. The study also highlights a list of state-of-the-art applications based on data gathered from social networking platforms. At the end, we identify the key issues and challenges related to social big data analytics.
1 F. Atefeh and W. Khreich. “A Survey of techniques for event detection in Twitter.” Computer intelligence, vol. 31, no. 1, 2015.
2 D. Laney. “3D data management: controlling data volume, velocity, and variety.” META group research note, vol. 6, no. 70, 2001.
3 J. Manyika, M. Chui, B. Brown, J. Bughin, R. Dobbs, C. Roxburgh, and A. Hung Byers. “Big data: The next frontier for innovation, competition, and productivity.” McKinsey Global Institute, 2011.
4 A. McAfee and E. Brynjolfsson. “Big data. The management revolution.” Harvard business review, vol. 90, no. 10, pp. 60-68, 2012.
5 A. De Mauro, M. Greco, and M. Grimaldi. "A formal definition of big data based on its essential features." Library Review, 2016.
6 A. Gandomi and M. Haider, M. “Beyond the hype: big data concepts, methods, and analytics.” International Journal of Information Management, vol. 35, no. 2, pp. 137–144, 2015.
7 I, Ha, B. Back and B. Ahn, “MapReduce functions to analyze sentiment information from social big data.” International Journal of Distributed Sensor Networks, vol. 11, no. 6, pp. 417502, 2015.
8 MongoDB, https://www.mongodb.com/, retrieved on Oct. 31, 2020.
9 Cassandra, https://cassandra.apache.org/, retrieved on Oct. 31, 2020.
10 DynamoDB, https://aws.amazon.com/dynamodb/, retrieved on Oct. 31, 2020.
11 BigTable, https://cloud.google.com/bigtable, retrieved on Oct. 31, 2020.
12 Hadoop, http://hadoop.apache.org/, retrieved on Oct. 31, 2020.
13 S. Peng, G. Wang and D. Xie. "Social influence analysis in social networking big data: Opportunities and challenges." IEEE network, vol. 31, no. 1, pp. 11-17, 2016.
14 D. Lian and Y. Xiong. "Big data analytics and business analytics." Journal of Management Analytics, vol. 2, no. 1, pp. 1-21, 2015.
15 H. Akaike. “Fitting autoregressive models for prediction.” Annals of the institute of Statistical Mathematics, vol. 21, no. 1, pp. 243-247, 1969.
16 S.E. Said, and D.A. Dickey. “Testing for unit roots in autoregressive-moving average models of unknown order.: Biometrika, vol. 71, no. 3, pp. 599-607, 1984.
17 A. Fahad, N. Alshatri, Z. Tari, A. Alamri, I. Khalil, A.Y. Zomaya, S. Foufou, and A. Bouras. “A survey of clustering algorithms for big data: Taxonomy and empirical analysis.” IEEE transactions on emerging topics in computing, vol. 2 no. 3, pp.267-279, 2014.
18 C.H. Lee and T.F. Chien. “Leveraging microblogging big data with a modified density-based clustering approach for event awareness and topic ranking.” Journal of Information Science, vol. 39, no. 4, pp. 523–543, 2013.
19 A.E. Bryson. “Applied optimal control: optimization, estimation and control.” CRC Press, 1975.
20 T. Ohbe, O. Tadachika and S. Toramatsu. "A sentiment polarity classifier for regional event reputation analysis." In proceedings of the international conference on web intelligence, 2017.
21 L. Kuang, L.T. Yang and Y. Liao, “An integration framework on cloud for cyber physical social systems big data,” IEEE Transactions on Cloud Computing, 2015.
22 H. Liu, L.T. Yang, Y. Guo, X. Xie, J. Ma. “An incremental tensor-train decomposition for cyber-physical-social big data.” IEEE Transactions on Big Data, 2018.
23 C.K. Leung, H. Zhang. “Management of distributed big data for social networks.” In 2016 16th IEEE/ACM international symposium on cluster, cloud and grid computing, pp. 639-648, 2016.
24 D. Jin, X. Ma, Y. Zhang, H. Abbas and H. Yu. “Information diffusion model based on social big data.” Mobile networks and applications, vol. 23, no. 4, pp. 717-722, 2018.
25 A. Paul, A. Awais, R. Mazhar and J. Sohail. "Smartbuddy: defining human behaviors using big data analytics in social internet of things." IEEE Wireless communications vol. 23, no. 5, pp. 68-74, 2016.
26 A. Sandryhaila and J. Moura. "Big data analysis with signal processing on graphs: representation and processing of massive data sets with irregular structure." IEEE signal processing magazine, vol. 31, no. 5, pp. 80-90, 2014.
27 A. Vinay, S. Vinay, J. Rituparna, A. Tushar, K. Balasubramanya and S. Murthya. "Cloud based big data analytics framework for face recognition in social networks using machine learning." Procedia Computer Science, vol. 50, pp. 623-630, 2015.
28 A. Severyn and A. Moschitti. “Twitter sentiment analysis with deep convolutional neural networks.” In proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp. 959–962, 2015.
29 Z. Su, Q. Xu and Q. Qi. "Big data in mobile social networks: A QoE-oriented framework." IEEE network, vol. 30, no. 1, pp. 52-57, 2016.
30 S.K. Lakshmanaprabu, K. Shankar, A. Khanna, D. Gupta, J. Rodrigues, P.R. Pinheiro, and V. Albuquerque. “Effective features to classify big data using social internet of things.” IEEE access, vol. 6, pp. 24196-24204, 2018.
31 P. Wongthongtham and B.A. Salih. "Ontology-based approach for identifying the credibility domain in social Big Data." Journal of organizational computing and electronic commerce, vol. 28, no.4, pp. 354-377, 2018.
32 J. He and N. Xiong. "An effective information detection method for social big data." Multimedia tools and applications, vol. 77, no. 9, pp. 11277-11305, 2018.
33 A.I. Torre-Bastida, E. Villar-Rodriguez, J. Del Ser, and S. Gil-Lopez. “Semantic information fusion of linked open data and social big data for the creation of an extended corporate CRM database.” Intelligent distributed computing VIII, pp. 211-221, 2015.
34 J. Kim and M. Hastak. “Social Network Analysis: the role of social media after a disaster.” 10th anniversary homeland defense/security education summit, 2017.
35 J. Yin, S. Karimi, A. Lampert, M. Cameron, B. Robinson and R. Power. “Using social media to enhance emergency situation awareness.” International joint conference on artificial intelligence, pp. 4234–4239, 2015.
36 J. Kim and M. Hastak. "Social network analysis: Characteristics of online social networks after a disaster." International journal of information management, vol. 38, no. 1, pp. 86-96, 2018.
37 E. Yoo, W. Rand, M. Eftekhar, and E. Rabinovich. “Evaluating information diffusion speed and its determinants in social media networks during humanitarian crises.” Journal of operations management, vol. 45, pp. 123–133, 2016.
38 S. Choi and B. Bae. "The real-time monitoring system of social big data for disaster management." Computer science and its applications, pp. 809-815, 2015.
39 Z. Xu, Y. Liu, N. Yen, L. Mei, X. Luo, X. Wei and C. Hu. “Crowdsourcing based description of urban emergency events using social media big data.” IEEE Transactions on Cloud Computing, 2016.
40 X. Zheng, W. Chen, P. Wang, D. Shen, S. Chen, X. Wang, Q. Zhang and L. Yang. "Big data for social transportation." IEEE transactions on intelligent transportation systems, vol. 17, no. 3, pp. 620-630, 2015.
41 E. D'Andrea, P. Ducange, B. Lazzerini, and F. Marcelloni. “Real-time detection of traffic from twitter stream analysis.” IEEE transactions on intelligent transportation systems, vol. 16, no. 4, pp. 2269-2283, 2015.
42 Z. Zhou, C. Gao, C. Xu, Y. Zhang, S. Mumtaz, and J. Rodriguez. “Social big-data-based content dissemination in internet of vehicles.” IEEE transactions on industrial informatics, vol. 14, no. 2, pp. 768-777, 2017.
43 Y. Zhang, X. Ma, S. Wan, H. Abbas, and M. Guizani. “CrossRec: Cross-domain recommendations based on social big data and cognitive computing.” Mobile networks and applications, vol. 23, no. 6, pp. 1610-1623, 2018.
44 S. J. Miah, H.Q. Vu, J. Gammack and M. McGrath. “A big data analytics method for tourist behavior analysis.” Information and management, vol. 54, no. 6, pp. 771–785, 2016.
45 P. Del Vecchio, G. Mele, V. Ndou, and G. Secundo. “Creating value from social big data: Implications for smart tourism destinations.” Information processing & management, vol. 54, no. 5, pp. 847-860, 2018.
46 E. Marine-Roig and S.A. Clavé. “Tourism analytics with massive user-generated content: A case study of Barcelona.” Journal of destination marketing & management, vol. 4, no. 3, pp. 162–172, 2015.
47 P. Neirotti, E. Raguseo, and E. Paolucci. “Are customers’ reviews creating value in the hospitality industry? Exploring the moderating effects of market positioning.” International journal of information management, vol. 36, no. 6, pp. 1133–1143, 2016.
48 Z. Xiang, Z. Schwartz, J.H. Gerdes and M. Uysal. “What can Big Data and text analytics tell us about hotel guest experience and satisfaction?” International journal of hospitality management, vo. 44, pp. 120–130, 2015.
49 P. Del Vecchio, G. Mele, V. Ndou, and G. Secundo. “Open innovation and social big data for sustainability: evidence from the tourism industry.” Sustainability, vol. 10, no. 9, pp. 3215, 2018.
50 Y. Kim, C.K. Kim, D.K. Lee, H.W. Lee, and R.I.T. Andrada. “Quantifying nature-based tourism in protected areas in developing countries by using social big data.” Tourism management, vol. 72, pp. 249-256, 2019.
51 K. Yamada, H. Takayasu and M. Takayasu. "Estimation of economic indicator announced by government from social big data." Entropy, vol. 20, no. 11, pp. 852, 2018.
52 M. Spagnuolo, F. Maggi, F and S. Zanero, S. “Bitiodine: Extracting intelligence from the bitcoin network.” International conference on financial cryptography and data security, 2014.
53 Y. Zhang, Z. Li, C. Gao, K. Bian, L. Song, S. Dong, and X. Li. “Mobile social big data: Wechat moments dataset, network applications, and opportunities.” IEEE network, vol. 32, no. 3, pp. 146-153, 2018.
54 J. Song, T. Song, and J. Lee. "Stay alert: forecasting the risks of sexting in Korea using social big data." Computers in human behavior vol. 81, pp. 294-302, 2018.
55 M. Tsou and M. Leitner. “Visualization of social media: seeing a mirage or a message?” Cartography and geographic information science, vol. 40, no. 2, pp. 55–60, 2013.
56 L. Manovich. “Trending: the promises and the challenges of big social data.” Debates in the digital humanities, pp. 460–475, 2012.
57 A. Cuzzocrea. "Privacy and security of big data: current challenges and future research perspectives." In proceedings of the first international workshop on privacy and security of big data, 2014.
58 C. Uppin and A. Sudhir. "A comprehensive review for security analysis of IoT platforms." International journal of computer science and security, vol. 14, no. 4, 2020.
59 N. Almolhis, M. Haney, F. Alqhtani, and K. Makdi. "Discovering and understanding the security issues in IoT cloud." International journal of computer science and security, vol. 13, no. 6, 2019.
60 A. Neyaz, A. Kumar, S. Krishnan, J. Placker and Q. Liu. “Security, privacy and steganographic analysis of FaceApp and TikTok.” International journal of computer science and security, vol 14, no. 2, 2020.
61 Z. Su and Q. Xu. "Security-aware resource allocation for mobile social big data: A matching- coalitional game solution." IEEE transactions on big data, 2017.
62 X. Liang, K. Zhang, X. Shen and X. Lin, “Security and privacy in mobile social networks challenges and solutions.” IEEE wireless communications, vol. 21, no. 1, pp: 33-41, 2014.
63 A. Jeckmans, M. Beye, Z. Erkin, P. Hartel, R. Lagendijk and Q. Tang “Privacy in recommender systems,” Social media retrieval, pp. 263-281, 2013.
64 P. Zhou, Y. Zhou, D. Wu, and H. Jin. “Differentially private online learning for cloud-based video recommendation with multimedia big data in social networks.” IEEE transactions on multimedia, vol. 18, no. 6, pp. 1217-1229, 2016.
65 J. Hurlock and M. Wilson.” Searching Twitter: separating the tweet from the chaff.” In international AAAI conference on weblogs and social media, 2011.
66 K.B. Lee and J. Caverlee. “Seven months with the devils: A long-term study of content polluters on Twitter.” In international AAAI conference on weblogs and social media, 2011.
67 M.H. Tsou. “Research challenges and opportunities in mapping social media and big data.” Cartography and geographic information science, vol. 42, pp. 70-74, 2015.
68 D. Maynard, Diana, K. Bontcheva, and D. Rout. "Challenges in developing opinion mining tools for social media." In proceedings of the @ NLP can u tag# usergeneratedcontent, pp. 15-22, 2012.
69 D. Metzler, C. Cai and E.H. Hovy. “Structured event retrieval over microblog archives.” In proceedings of the 2012 conference of the north American chapter of the association for computational linguistics: human language technologies, pp. 646–655, 2012.
70 E. Hargittai. "Is bigger always better? Potential biases of big data derived from social network sites." The annals of the American academy of political and social science, vol. 659, no.1, pp. 63-76, 2015.
71 K. Crawford. “The hidden biases of big data.” Harvard business review, 2013.
72 T. Harford. “Big data: Are we making a big mistake?” FT magazine, 2014.
73 Z. Tufekci. "Big questions for social media big data: representativeness, validity and other methodological pitfalls." arXiv preprint arXiv:1403.7400, 2014.
74 A. Labrinidis and H. Jagadish, “Challenges and opportunities with big data,” In proceedings of the VLDB endowment, vol. 5, no.12, pp. 2032–2033, 2012.
75 N. Khan, I. Yaqoob, I. Hashem, Z. Inayat, W.K. Mahmoud Ali, M. Alam, M. Shiraz, and A. Gani. "Big data: survey, technologies, opportunities, and challenges." The scientific world journal, 2014.
Dr. Tina Tian
Department of Computer Science, Manhattan College, New York - United States of America