AUTHOREA
Log in Sign Up Browse Preprints
LOG IN SIGN UP
Meng Chaolu
Meng Chaolu

Public Documents 1
IIFS2.0: An Improved Incremental Feature Selection Method for Protein Sequence Proces...
Meng Chaolu
Yue Pei

Meng Chaolu

and 6 more

April 14, 2024
The purpose of feature selection in protein sequence recognition problems is to select the optimal feature set and use it as training input for classifiers and discover key sequence features of specific proteins. In the feature selection process, relevant features associated with the target task will be retained, and irrelevant and redundant features will be removed. Therefore, in an ideal state, a feature combination with smaller feature dimensions and higher performance indicators is desired. This paper proposes an algorithm called IIFS2.0 based on the cache elimination strategy, which takes the local optimal combination of cached feature subsets as a breakthrough point. It searches for a new feature combination method through the cache elimination strategy to avoid the drawbacks of human factors and excessive reliance on feature sorting results. We validated and analyzed its effectiveness on the protein dataset, demonstrating that IIFS2.0 significantly reduces the dimensionality of feature combinations while also improving various evaluation indicators. In addition, we provide IIFS2.0 on http://112.124.26.17:8006/ for researchers to use.

| Powered by Authorea.com

  • Home