Home
DNA
RNA
Protein
Input Dataset
Enter Protein sequences (FASTA format):
input dataset
Enter data in FASTA format. The title includes id, label (0 or 1), and train/test.
Load Example
Clear
Upload Dataset File:
Select a file:
Only .txt, .fa, or .fasta files are allowed, up to 3MB.
Feature Extraction
AAC
CHHAA
One-hot
Kmer
DR
Distance Pair
AC
CC
ACC
PDT
PC-PseAAC-General
SC-PseAAC-General
PC-PseAAC
SC-PseAAC
Top-n-gram
PDT-Profile
DT
AC-PSSM
CC-PSSM
ACC-PSSM
Submit
File input
(The feature vectors can be uploaded here:(Scikit-learn format))
Download Example File
train_data:
train_labels:
test_data:
test_labels:
All submitted files must be in
CSV
format.
The training dataset (
train_data
) and testing dataset (
test_data
) must have the same data dimensions.
The number of samples in the training dataset (
train_data
) must match the number of samples in the training label set (
train_labels
), and the number of samples in the testing dataset (
test_data
) must match the number of samples in the testing label set (
test_labels
).
The label file (
labels
) should contain only a single column of data, and all entries must be of
integer
type.
The label file must contain at least
two distinct labels
.
Submit