Measure Effectiveness of SMS Spam Detection Model Based on Machine Learning Techniques

Authors

  • Ahmed Hamed Osman ,
  • Muhammad Badawi Al-Khalifa ,

DOI:

https://doi.org/10.26389/AJSRP.N020123

Keywords:

Accuracy, Classification, Confusion Matrix, Dataset, ham, Natural Language Processing

Abstract

With the increase in the use of mobile phones, the use of Short Message Service has increased exponentially. With the cost of text messages dropping, people started using them for promotional purposes and unethical activities. This led to a massive increase in spam and consequently the loss of personal and financial data. To prevent data loss, it is essential that spam is detected as quickly as possible. Thus, this paper aims to classify spam not only effectively but also in a short time using python. A dataset containing thousands of text messages containing natural messages (ham) and spam messages was used. Natural language processing techniques were used Multiomail Naive Bayes, Decision Tree and Random Forest are used through which we can classify the message type. After applying these algorithms, Random Forest algorithm got the best accuracy 0.99% in 0.15 second.

Author Biographies

  • Ahmed Hamed Osman, ,

    College of Computer Science and Information Technology | Mashreq University | Sudan

  • Muhammad Badawi Al-Khalifa, ,

    College of Computer Science and Information Technology | Mashreq University | Sudan

References

Downloads

Published

2023-03-30

Issue

Section

Content

How to Cite

Osman, A. H., & Al-Khalifa, M. B. (2023). Measure Effectiveness of SMS Spam Detection Model Based on Machine Learning Techniques. Journal of Engineering Sciences and Information Technology, 7(1), 58-68. https://doi.org/10.26389/AJSRP.N020123