Spam SMS often appears similar to legitimate (ham) messages but contains potential threats or malicious links designed to hack personal information such as bank account details, passport details, etc. Many mobile users have been affected because they unknowingly access spam messages and provide information through the link given, and they have lost a huge amount of money. Many industries are working on this to prevent such spam messages from entering the user's inbox using advanced machine or deep learning techniques. Any Artificial Intelligence technique requires enough data to train the model to perform better during real-time testing. Recently, there have been enough benchmark spam SMS datasets available for English; however, there are fewer for Indian languages. This paper provides an overview of an SMSDHL dataset comprising Dravidian languages-Tamil, Telugu, Kannada, and Malayalam with Hindi and English. The dataset can be utilized for various applications in Information Extraction, Information Retrieval, and Natural Language Processing (NLP).