Open Access Open Access  Restricted Access Subscription or Fee Access

Preserving the Value of SMS Texting: A Survey on Mobile SMS Spam Classification Techniques and Algorithms

Khuteja Farha I. Kazi, Shweta C. Dharmadhikari

Abstract


Text classification concept can be realized in spam filtering for Email/SMS domain. There had been a variety of classification strategies used for email categorization including both supervised and unsupervised. SMS spam filtering is a new task comparatively to email spam filtering which inherits many issues and solutions from email spam filtering. The increase in the mobile phone users has lead to a substantial growth of SMS spam messages. Due to increase in use of Short Message Service (SMS) over mobile phones because of cheap and bulk pre-pay sms packages , there has been a burst of spam SMSes. Along with the range of classification algorithms the use of SMS features is also varied accordingly from non-content based to content based. In this survey paper the previous attempts in SMS spam domain and recent developments in SMS spam filtering have been summarized that contributes in preserving the value of SMS by reducing the spamminess of the SMS. Some of the algorithms that contributed in SMS spam filtering and their comparison is discussed. Further SVM based spam filtering system is proposed, that classifies spam and non-spam SMSes. This filtering system can further classify non spam SMSes in predefined categories that can be viewed in different tabs for comparatively easier and faster user access to a particular SMS.

Keywords


Text Classification, Spam Filtering, SMS Classification, SMS Spam.

Full Text:

PDF

References


Sarah Jane Delany, Mark Buckley, and Derek Greene, “Review: SMS spam filtering: Methods and data ,”. Expert Syst. Appl. 39, 10, pp. 9899-9908 ,August 2012.

Qian Xu; Xiang, E.W.; Qiang Yang; Jiachun Du; JiepingZhong, "SMS Spam Detection Using Noncontent Features," Intelligent Systems, IEEE , vol.27, no.6, pp.44,51, Nov.-Dec. 2012

Thiago S. Guzella, Walmir M. Caminhas, “A review of machine learning approaches to Spam filtering,” Expert Systems with Applications, Tarrytown, NY, USA, Volume 36, Issue 7, pp. 10206--10222, September 2009.

Chih-Chin Lai, “An empirical study of three machine learning methods for spam filtering,” Knowledge-Based Systems, vol. 20, Issue 3, , pp. 249-254, April 2007.

Drucker, H.; Wu, S.; Vapnik, V.N., "Support vector machines for spam categorization," Neural Networks, IEEE Transactions on , vol.10, no.5, pp.1048,1054, Sep 1999

José María Gómez Hidalgo, Guillermo CajigasBringas, Enrique PuertasSánz, and Francisco CarreroGarcía , “Content based SMS spam filtering ,” In Proceedings of the 2006 ACM symposium on Document engineering (DocEng '06). ACM, New York, NY, USA, pp. 107-114 ,2006.

Belem, D.; Duarte-Figueiredo, F., "Content filtering for SMS systems based on Bayesia8.n classifier and word grouping," Network Operations and Management Symposium (LANOMS), 2011 7th Latin American , vol., no., pp.1,7, 10-11 Oct. 2011

JieCai; Yuezhong Tang; Rile Hu, "Spam Filter for Short Messages Using Winnow," Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on , vol., no., pp.454,459, 23-25 July 2008

Kuldeep Yadav, PonnurangamKumaraguru, AtulGoyal, Ashish Gupta, and VinayakNaik , “SMSAssassin: crowdsourcing driven mobile-based system for SMS spam filtering,” In Proceedings of the 12th Workshop on Mobile Computing Systems and Applications (HotMobile '11). ACM, New York, NY, USA pp. 1-6 , 2011.

Uysal, A.K.; Gunal, S.; Ergin, S.; Gunal, E.S., "A novel framework for SMS spam filtering," Innovations in Intelligent Systems and Applications (INISTA), 2012 International Symposium on , vol., no., pp.1,4, 2-4 July 2012

Gordon V. Cormack, José María Gómez Hidalgo, and Enrique PuertasSán, “ Feature engineering for mobile (SMS) spam filtering,” In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '07). ACM, New York, NY, USA, pp. 871-872 ,2007

Mathew, K.; Issac, B., "Intelligent spam classification for mobile text message," Computer Science and Network Technology (ICCSNT), 2011 International Conference on , vol.1, no., pp.101,105, 24-26 Dec. 2011.

Wei-Wei Deng; Hong Peng, "Research on a Naive Bayesian Based Short Message Filtering System," Machine Learning and Cybernetics, 2006 International Conference on , vol., no., pp.1233,1237, 13-16 Aug. 2006

Ikonomakis, M., S. Kotsiantis, and V. Tampakas. "Text classification using machine learning techniques." WSEAS Transactions on Computers 4.8 , pp. 966-974 ,2005

K., Uysal A., S., Gunal, S., Ergin and Sora, GunalE.. "The Impact of Feature Extraction and Selection on SMS Spam Filtering," Electronics & Electrical Engineering 19 , no. 5, pp. 67 - 72, 2013.

Ningning Wu; Mingguang Wu; Siguo Chen, "Real-time monitoring and filtering system for mobile SMS," Industrial Electronics and Applications, 2008. ICIEA 2008. 3rd IEEE Conference on , vol., no., pp.1319,1324, 3-5 June 2008Contributions to the study of SMS Spam Filtering: New Collection and Results.

DuanLongzhen; Li An; Huang Longjun, "A New Spam Short Message Classification," Education Technology and Computer Science, 2009. ETCS '09. First International Workshop on , vol.2, no., pp.168,171, 7-8 March 2009

H. Wenliang, Z. Ni, and D. Yutao, “Spam SMS Detection Based on Mobile Terminal Location and Content,” Mobile Communications, vol. 32, no. 13, 2008, pp. 70–74 (in Chinese).

G.V. Cormack, J. María Gómez Hidalgo, and E. PuertasSánz, “Spam Filtering for Short Messages,” Proc. ACM Conf. Information and Knowledge Management (CIKM 07), ACM, NY, USA, pp. 313–320 ,2007.

Akshay Narayan and PrateekSaxena , “ The curse of 140 characters: evaluating the efficacy of SMS spam detection on android,” In Proceedings of the Third ACM workshop on Security and privacy in smartphones & mobile devices (SPSM '13), ACM, New York, NY, USA, pp. 33-42, 2013

Muhammad Bilal Junaid and Muddassar Farooq, “ Using evolutionary learning classifiers to do MobileSpam (SMS) filtering,” In Proceedings of the 13th annual conference on Genetic and evolutionary computation (GECCO '11) ACM, NatalioKrasnogor (Ed.), NY, USA, pp. 1795-1802,2011

Ji Won Yoon, Hyoungshick Kim, Jun Ho Huh, “Hybrid spam filtering for mobile communication,” Computers & Security, Vol. 29, Issue 4, pp. 446-459, June 2010.

http://en.wikipedia.org/wiki/Support_vector_machine

http://en.wikipedia.org/wiki/Naive_Bayes_classifier.

http://en.wikipedia.org/wiki/K-nearest_neighbors_algorith

http://nlp.stanford.edu/IR-book/html/htmledition/support-vector-machines-the-linearly-separable-case-1.htm


Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.