AUTHOREA
Log in Sign Up Browse Preprints
LOG IN SIGN UP
Srujan Murthy
Srujan Murthy
COO at CSX Labs
Celina, Tx

Public Documents 5
Sapphire: An NLP-based YouTube video scoring model
Srujan Murthy

Srujan Murthy

June 13, 2024
This whitepaper introduces Sapphire, a novel ranking model designed to evaluate YouTube videos based on the comprehensive analysis of their transcripts' corpus. By employing regex operations and identifying the most unique and significant keywords throughout the video content, Sapphire offers a more analytical approach to evaluation, considering the relative importance of individual terms. The primary objective of Sapphire is to address the challenges associated with ranking transcripts based on their rigor, independently of video viewership, which is the conventional approach adopted by the YouTube Watch Time algorithm[1]. Additionally, Sapphire includes transcription based on unique identifier keyword weighting strategies. This paper details Sapphire, exploring key components such as YouTube transcription, text preprocessing, Term Frequency-Inverse Document Frequency (TF-IDF) evaluators, and score assessments.
A probabilistic method to rigor based text scoring
Srujan Murthy

Srujan Murthy

November 26, 2024
This paper introduces a novel method for evaluating the rigor of written text using Markov chain Monte Calro simulations. Traditional approaches to text analysis, such as readability metrics or grammatical checks, often fall short in assessing the logical consistency, structural complexity, and overall coherence that characterize rigorous writing[SWSK24]. I propose a probabilistic model that leverages Markov chains to quantify text rigor by analyzing word and phrase transitions within the text. Specifically, we model each word or n-gram as a state within a Markov chain and compute transition probabilities that capture patterns of consistency and complexity[Scind].
LanePro: A new approach towards Lane assistance
Srujan Murthy

Srinidhi Srujan Murthy

August 01, 2024
Lane detection is a critical component of autonomous driving systems, enabling vehicles to identify and navigate within lanes accurately. This paper presents a novel approach to enhancing lane detection ac curacy using the Mask R-CNN algorithm. By leveraging the capabilities of Mask R-CNN, the proposed algorithm demonstrates efficient and precise detection of road lanes, including the classification of lane types and angle evaluation for steer ing purposes. The algorithm’s functionality encompasses determining bounding boxes of lanes, angle evaluation through image cropping, classification, and lane data configuration for schematic environ mental surveillance. Through extensive testing, the algorithm has shown superior performance in scenarios with challenging conditions such as insufficient lighting and lane line degradation. The results indicate a significant improvement in lane detection accuracy, making it a promising solution for advancing the capabilities of autonomous driving systems.  
Sapphire and the Aho-Corasick algorithm
Srujan Murthy

Srinidhi Srujan Murthy

July 22, 2024
This paper explores the implications of the Aho-Corasick algorithm in enhancing text analysis in the Sapphire algorithm. The Aho-Corasick algorithm is pivotal in Sapphire's functionality, facilitating the assignment of tokens to their respective values on the frequency distribution and extracting their corresponding TF-IDF scores. This integration significantly improves the efficiency of text evaluation, underscoring the essential role of the Aho-Corasick algorithm.
Sapphire vs Conventional YouTube Video Ranking
Srujan Murthy

Srujan Murthy

and 1 more

June 21, 2024
This paper introduces how Sapphire varies from the standard conventional methodologies used for ranking YouTube Videos. By implementing Natural Language Processing, specifically Computer Linguistics, Sapphire enhances YouTube Video ranking by specifically weighing videos based on how much content they covers rather than their Statistics. The conventional Watch Time algorithm ranks videos based on features such as viewership status, likes, subscriptions, etc [Fyf20]. This Whitepaper delves into what makes Sapphire distinct and domain specific from the Watch Time Algorithm.

| Powered by Authorea.com

  • Home