Tools
######
`FastQ Quality Scores `_
Assembler
==========
Assember...
Flye
-----
Flye is a de novo assembler.
.. highlight:: bash
:linenothreshold: 1
bash::
conda install -c bioconda flye
flye --nano-raw \
~/course_data/precompiled/all_guppy.fastq \
--genome-size 1m --out-dir ./flye_output
* `Flye github `_
* `Flye youtube `_
* `Flye tutorial `_
miniasm
--------
Miniasm is a very fast OLC-based de novo assembler for noisy long reads.
.. highlight:: bash
:linenothreshold: 1
miniasm::
conda install -c bioconda miniasm
* `miniasm github `_
* `miniasm youtube `_
quickmerge
===========
Quickmerge will even work with hybrid assemblies made by combining long reads and Illumina short reads.
.. highlight:: bash
:linenothreshold: 1
quickmerge::
conda install -c conda-forge -c bioconda quickmerge
# using wrapper
merge_wrapper.py hybrid_assembly.fasta self_assembly.fasta
Trimmer
=========
Nanofilt
--------
Trimming and filtering Oxford Nanopore sequencing reads.
.. highlight:: bash
:linenothreshold: 1
Nanofilt::
NanoFilt –l 500 --headcrop 10 -q 10 < ./Q5705/data/${sample}_Blockchain_v6.0.1.fastq > ./Q5705/trimmed/${sample}_Blockchain_v6.0.1.trimmed.fastq
* `Nanofilt github `_
Porechop
--------
Porechop is a tool for finding and removing adapters from Oxford Nanopore reads.
.. highlight:: bash
:linenothreshold: 1
Porechop::
conda install -c bioconda porechop
porechop -i input_reads.fastq.gz -o output_reads.fastq.gz
* `Porechop github `_
Graph
======
VG
-----
* `VG CPANG 2018 `_
* `VG CPANG 20192020 `_
minigraph
----------
* `minigraph tutorial `_
pggb
----------
* `pggb github `_
Alignment
==========
minimap2
--------
* `Options [kr] `_
* `Nanopore sequencing de novo assembly [kr] `_
* `Genome Assembly with Minimap2 and Miniasm `_
* `Man page `_
paftools
^^^^^^^^
paftools.js is a script that processes alignments in the PAF format, such as converting between formats, evaluating mapping accuracy, lifting over BED files based on alignment, and calling variants from assembly-to-assembly alignment. This script requires the k8 Javascript shell to run. On Linux or Mac, you can download the precompiled k8 binary with:
Winnowmap
---------
Winnowmap is a long-read mapping algorithm optimized for mapping ONT and PacBio reads to repetitive reference sequences. Winnowmap development began on top of minimap2 codebase, and since then we have incorporated the following two ideas to improve mapping accuracy within repeats.
* `Winnowmap github `_
MUMmer
------
`Options `_
dnadiff
^^^^^^^
`dnadiff readme `_
Consensus
==========
Racon
------
Racon is a standalone consensus building tool that can be coupled with a fast assembler such as miniasm, which performs de novo assembly with error prone long reads without error corrections. This dramatically cut down the time needed for sequence assembly and consensus generation. Racon stands for Rapid Consensus and it can be used for PacBio and Oxford Nanopore data.
* `Racon github `_
Graph
======
Bandage
-------
`How to Use Assembly Graphs with Metagenomic Datasets `_
SAM/BAM
=========
Samtools
---------
* `Samtools offical doc `_
* `Samtools How to use `_
.. highlight:: bash
:linenothreshold: 1
bash::
samtools view -S –b ${sample}_trimmed.sam > ${sample}_trimmed.bam
samtools sort –o ${sample}_trimmed.sorted.bam ${sample}_trimmed.bam
samtools index ${sample}_trimmed.sorted.bam
# get the total number of reads of a BAM file (may include unmapped and duplicated multi-aligned reads)
samtools view -c SAMPLE.bam
# counting only mapped (primary aligned) reads
samtools view -c -F 260 SAMPLE.bam
SV Caller
=========
CuteSV
---------
* `cuteSV github `_
Option
.. highlight:: none
.. code-block:: rst
> For ONT data:
--max_cluster_bias_INS 100
--diff_ratio_merging_INS 0.3
--max_cluster_bias_DEL 100
--diff_ratio_merging_DEL 0.3