|
1
|
Diamond 2.0
|
Used for scanning unassembled read sets for small DNA tumor virus hallmark protein sequences.
|
38712252
|
View Related Publication
|
|
2
|
SRA Toolkit 2.9.6 (fastq-dump module)
|
Used for initial sequence downloads with settings to split files and gzip.
|
|
3
|
SRA Toolkit 3.0.3 (fasterq-dump module)
|
Used for recent assembly efforts to download reads faster.
|
|
4
|
Seqtk
|
Used to sample 50 million read pairs.
|
|
5
|
fastp
|
Used to quality-trim reads.
|
|
6
|
Megahit 1.2.9
|
Used to assemble trimmed reads with a specified minimum contig length.
|
|
7
|
CLC Genomics Workbench 22
|
Used for importing de novo-assembled contig sequences and converting them into BLAST databases.
|
|
8
|
Cenote Taker 2
|
Used for virus genome detection and gene annotation.
|
|
9
|
Cenote Taker 3
|
Used for auto-annotation of virus groups outside the scope of the current survey.
|
|
10
|
MacVector 18
|
Used for compiling annotations and displaying maps.
|
|
11
|
HHpred
|
Used for protein structure prediction analysis.
|
|
12
|
Phyre2
|
Used for protein structure prediction.
|
|
13
|
Dali
|
Used for structural comparisons using AlphaFold2 or RoseTTAfold predictions.
|
|
14
|
EMBOSS getorf
|
Used for analyzing protein sequence clusters.
|
|
15
|
EFI-EST
|
Used for enzyme function discovery and metabolic pathway analysis.
|
|
16
|
PhyML
|
Used for inferring phylogenetic trees.
|
|
17
|
KnotInFrame
|
Used for predicting programmed -1 ribosomal frameshift slippery sequences.
|
|
18
|
Cytoscape
|
Used for visualizing network analyses of viral gene content.
|
|
19
|
FigTree software
|
Used for viewing phylogenetic trees.
|