AUTHOREA
Log in
Sign Up
Browse Preprints
LOG IN
SIGN UP
Essential Site Maintenance
: Authorea-powered sites will be updated circa 15:00-17:00 Eastern on Tuesday 5 November.
There should be no interruption to normal services, but please contact us at help@authorea.com in case you face any issues.
Nong Xiao
Public Documents
1
High-Frequency K-mer Counting at Low Memory Footprint
Li Mocheng
and 3 more
July 24, 2022
Genomics data analysis requires efficient tools to address the vast amount of data generated by current next-generation sequencing technologies. K-mer counting works face difficulties in balancing high memory overhead with statistical precision. We designed a high-frequency k-mer statistical computation based on the Space Saving algorithm and a novel hash table structure, which reduces the memory overhead by 46\% while ensuring high computational efficiency.