Home
DNA
RNA
Protein
Input Dataset

Enter DNA sequences (FASTA format):

Enter data in FASTA format. The title includes id, label (0 or 1), and train/test.  Sequence lengths need to be consistent.

Upload Dataset File:

Only .txt, .fa, or .fasta files are allowed, up to 3MB.
Feature Extraction
File input   (The feature vectors can be uploaded here:(Scikit-learn format)) Download Example File
  1. All submitted files must be in CSV format.
  2. The training dataset (train_data) and testing dataset (test_data) must have the same data dimensions.
  3. The number of samples in the training dataset (train_data) must match the number of samples in the training label set (train_labels), and the number of samples in the testing dataset (test_data) must match the number of samples in the testing label set (test_labels).
  4. The label file (labels) should contain only a single column of data, and all entries must be of integer type.
  5. The label file must contain at least two distinct labels.