[Back] [FAQ] Fasta dataset splitter - help page
 

This tool will simply split your data into two sets:

  1. All sequences containing the keyword in the header
  2. The rest of the sequences
Example:
>seq1|Male|Pop1
AGCATGCTAGCTAGTCGACTCGATCGCATGCTC
>seq2|Male|Pop1
AGCATGCTAGCTAGACGACTCGATCGCATGCTC
>seq3|Female|Pop2
AGCATGCTAGCTAGTCGACTCGATCGCATGCTC
>seq4|Female|Pop3
AGCATGCTACCTAGTCGACTCGATCGCATGCTC
>seq5|Male|Pop4
AGCATGCTAGCTAGTCGACTCGATGGCATGCTC

If you choose the keyword 'Female' it will make two sets, the first with the two sequences containing the term 'female'.

'Pop1' will give you two sequences in the first set and the remaining two sequences in the second set.

NOTE: if you choose 'male' you will only get one set, since all sequence headers contain the word 'male' (feMALE), here the correct term would be '|male|' - to get the males.