A set of various short scripts written in Python and used for sequence manipulation, gene ontology manipulation and such.
If you use those scripts and find any error, please report it!
See the README file in the gene_ontology folder.
uniq-fasta.py
Script to report unique fasta sequences from one or several fasta files. Sequences are reported in the fasta format, using a hash of the sequence as the sequence name.
Example usage:
# For one input file python uniq-fasta.py mySeq.fasta -o myUniqSeq.fasta cat mySeq.fasta | python uniq-fasta.py > myUniqSeq.fasta # For several input files cat *.fasta | python uniq-fasta.py > myUniqSeq.fasta
Those tools are in the sequences folder. Each script performs a very simple task.