0 | 0 |
[](https://travis-ci.org/conchoecia/pauvre) [](https://zenodo.org/badge/latestdoi/112774670)
|
1 | 1 |
|
|
2 |
## <a name="started"></a>Getting Started
|
2 | 3 |
|
3 | |
## pauvre: a plotting package designed for nanopore and PacBio long reads
|
|
4 |
```
|
|
5 |
pauvre custommargin -i custom.tsv --ycol length --xcol qual # Custom tsv input
|
|
6 |
```
|
4 | 7 |
|
5 | |
This package currently hosts four scripts for plotting and/or printing stats.
|
|
8 |
## Table of Contents
|
|
9 |
|
|
10 |
- [Getting Started](#started)
|
|
11 |
- [Users' Guide](#uguide)
|
|
12 |
- [Installation](#installation)
|
|
13 |
- [Requirements](#reqs)
|
|
14 |
- [Install Instructions](#install)
|
|
15 |
- [Usage](#usage)
|
|
16 |
- [pauvre stats](#stats)
|
|
17 |
- [pauvre marginplot](#marginplot)
|
|
18 |
- [Basic Usage](#marginbasic)
|
|
19 |
- [Plot Adjustments](#marginadjustments)
|
|
20 |
- [Specialized Options](#marginspecialized)
|
|
21 |
- [Contributors](#contributors)
|
|
22 |
|
|
23 |
## <a name="uguide"></a>Users' Guide
|
|
24 |
|
|
25 |
Pauvre is a plotting package originally designed to help QC the length and
|
|
26 |
quality distribution of Oxford Nanopore or PacBio reads. The main outputs
|
|
27 |
are marginplots. Now, `pauvre` also hosts other additional data plotting
|
|
28 |
scripts.
|
|
29 |
|
|
30 |
This package currently hosts five scripts for plotting and/or printing stats.
|
6 | 31 |
|
7 | 32 |
- `pauvre marginplot`
|
8 | 33 |
- takes a fastq file as input and outputs a marginal histogram with a heatmap.
|
|
34 |
- `pauvre custommargin`
|
|
35 |
- takes a tsv as input and outputs a marginal histogram with custom columns of your choice.
|
9 | 36 |
- `pauvre stats`
|
10 | 37 |
- Takes a fastq file as input and prints out a table of stats, including how many basepairs/reads there are for a length/mean quality cutoff.
|
11 | 38 |
- This is also automagically called when using `pauvre marginplot`
|
|
23 | 50 |
genomes with the fewest crossings-over. Input is one `.gff` file
|
24 | 51 |
per circular genome and one directory of gene alignments.
|
25 | 52 |
|
26 | |
# Requirements
|
|
53 |
|
|
54 |
## <a name="installation"></a>Installation
|
|
55 |
|
|
56 |
### <a name="reqs"></a>Requirements
|
27 | 57 |
|
28 | 58 |
- You must have the following installed on your system to install this software:
|
29 | 59 |
- python 3.x
|
|
32 | 62 |
- pandas
|
33 | 63 |
- pillow
|
34 | 64 |
|
35 | |
# Installation
|
36 | |
|
|
65 |
### <a name="install">Install Instructions
|
37 | 66 |
- Instructions to install on your mac or linux system. Not sure on
|
38 | 67 |
Windows! Make sure *python 3* is the active environment before
|
39 | 68 |
installing.
|
|
43 | 72 |
- Or, install with pip
|
44 | 73 |
- `pip3 install pauvre`
|
45 | 74 |
|
46 | |
# Usage
|
|
75 |
## <a name="usage"><a/>Usage
|
47 | 76 |
|
48 | |
## `stats`
|
|
77 |
### <a name="stats"></a>`stats`
|
49 | 78 |
- generate basic statistics about the fastq file. For example, if I
|
50 | 79 |
want to know the number of bases and reads with AT LEAST a PHRED
|
51 | 80 |
score of 5 and AT LEAST a read length of 500, run the program as below
|
|
77 | 106 |
et cetera...
|
78 | 107 |
```
|
79 | 108 |
|
80 | |
## `marginplot`
|
|
109 |
### <a name="marginplot"></a>`marginplot`
|
81 | 110 |
|
82 | |
### Basic usage
|
|
111 |
#### <a name="marginbasic"></a>Basic Usage
|
83 | 112 |
- automatically calls `pauvre stats` for each fastq file
|
84 | 113 |
- Make the default plot showing the 99th percentile of longest reads
|
85 | 114 |
- `pauvre marginplot --fastq miniDSMN15.fastq`
|
|
89 | 118 |
- `pauvre marginplot --maxlen 4000 --maxqual 25 --lengthbin 50 --fileform pdf png --qualbin 0.5 --fastq miniDSMN15.fastq`
|
90 | 119 |
- 
|
91 | 120 |
|
92 | |
### Filter reads and adjust viewing window
|
|
121 |
#### <a name="marginadjustments"></a>Plot Adjustments
|
|
122 |
|
93 | 123 |
- Filter out reads with a mean quality less than 5, and a length
|
94 | 124 |
less than 800. Zoom in to plot only mean quality of at least 4 and
|
95 | 125 |
read length at least 500bp.
|
96 | 126 |
- `pauvre marginplot -f miniDSMN15.fastq --filt_minqual 5 --filt_minlen 800 -y --plot_minlen 500 --plot_minqual 4`
|
97 | 127 |
- 
|
98 | 128 |
|
99 | |
### Specialized Options
|
|
129 |
#### <a name="marginspecialized"></a>Specialized Options
|
100 | 130 |
|
101 | 131 |
- Plot ONT 1D data with a large tail
|
102 | 132 |
- `pauvre marginplot --maxlen 100000 --maxqual 15 --lengthbin 500 <myfile>.fastq`
|
103 | 133 |
- Get more resolution on lengths
|
104 | 134 |
- `pauvre marginplot --maxlen 100000 --lengthbin 5 <myfile>.fastq`
|
105 | 135 |
|
106 | |
### Transparency
|
107 | |
|
108 | 136 |
- Turn off transparency if you just want a white background
|
109 | 137 |
- `pauvre marginplot --transparent False <myfile>.fastq`
|
110 | 138 |
- Note: transparency is the default behavior
|
111 | 139 |
- 
|
112 | 140 |
|
113 | |
# Contributors
|
|
141 |
## <a name="contributors"></a>Contributors
|
114 | 142 |
|
115 | 143 |
@conchoecia (Darrin Schultz)
|
116 | 144 |
@mebbert (Mark Ebbert)
|