Home
DNA
RNA
Protein
Input Dataset
Enter DNA sequences (FASTA format):
input dataset
Enter data in FASTA format. The title includes id, label (0 or 1), and train/test. Sequence lengths need to be consistent.
Load Example
Clear
Upload Dataset File:
Select a file:
Only .txt, .fa, or .fasta files are allowed, up to 3MB.
Feature Extraction
DAC
DACC
DCC
GAC
IDKmer
Kmer
MAC
NMBAC
One-hot
RevKmer
TAC
TCC
TACC
NMBAC
GAC
MAC
Submit
File input
(The feature vectors can be uploaded here:(Scikit-learn format))
Download Example File
train_data:
train_labels:
test_data:
test_labels:
All submitted files must be in
CSV
format.
The training dataset (
train_data
) and testing dataset (
test_data
) must have the same data dimensions.
The number of samples in the training dataset (
train_data
) must match the number of samples in the training label set (
train_labels
), and the number of samples in the testing dataset (
test_data
) must match the number of samples in the testing label set (
test_labels
).
The label file (
labels
) should contain only a single column of data, and all entries must be of
integer
type.
The label file must contain at least
two distinct labels
.
Submit