AUTHOREA
Log in Sign Up Browse Preprints
LOG IN SIGN UP
Venkatesh Koreddi
Venkatesh Koreddi

Public Documents 2
Enhancing Text Summarization and Audio Generation Using Hybrid Model
Venkatesh Koreddi
Shaik Chandini

Venkatesh Koreddi

and 3 more

November 06, 2024
In today's fast-paced world, efficiently processing the vast amount of information in lengthy documents poses significant challenges. Traditional methods of reading and understanding multi-page documents, often spanning six to seven pages, are time-consuming and cumbersome for students, scholars, and professionals who face constant time constraints. This project introduces a novel approach by integrating advanced text summarization with real-time audio generation. Unlike existing solutions, which either focus on summarization or audio conversion separately, this dual-system solution uniquely combines both, allowing users to quickly grasp key content and consume it in audio form while on the move. The summarization component effectively condenses large documents into concise summaries, preserving the critical information and context. Meanwhile, the audio generation component transforms these summaries into speech, enhancing accessibility for auditory learners and users on the go. This innovative combination not only saves time but also offers flexibility, catering to diverse user preferences and learning styles.
Multilingual AI System for Detecting Offensive Content Across Text, Audio, and Visual...
Venkatesh Koreddi
Nalluri Manisha

Venkatesh Koreddi

and 3 more

November 06, 2024
This project aims to develop an AI system with advanced capabilities to detect offensive language across diverse platforms—covering text, audio (both live and recorded speech), and images (such as memes)—in multiple languages. By leveraging technologies like Natural Language Processing (NLP), Speech Recognition (SR), and Optical Character Recognition (OCR) for identifying text within images, the system can already flag potentially harmful or inappropriate content. Integration with Google Translator ensures automatic detection and translation of input languages, enabling global applicability and enhanced reliability. For text analysis, the system utilizes BERT (Bidirectional Encoder Representations from Transformers), a large, pre-trained model known for its strong contextual and semantic comprehension of human language. As the digital landscape rapidly evolves, precise identification of offensive content is becoming ever more essential. Through this project, we are building robust, fair, and impactful technology to foster safer online environments for all users, addressing this significant challenge head-on.

| Powered by Authorea.com

  • Home