Skip to content

Commit c2f5102

Browse files
committed
update documentation
1 parent a200ef4 commit c2f5102

4 files changed

Lines changed: 12 additions & 6 deletions

File tree

config/README.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,8 @@ For each sample, add one or more sequencing units (runs, lanes or replicates) to
2727
* `fq1` and `fq2` for paired end reads. These can point to any FASTQ files on your system
2828
* `sra` only: specify an SRA (sequence read archive) accession (starting with e.g. ERR or SRR). The pipeline will automatically download the corresponding paired end reads from SRA.
2929
* If both local files (`fq1`, `fq2`) and SRA accession (`sra`) are available, the local files will be used.
30-
* Define adapters in the `adapters` column, by putting [cutadapt arguments](https://cutadapt.readthedocs.org) in quotation marks (e.g. `"-a ACGCGATCG -A GCTAGCGTACT"`).
30+
* Define adapters in the `adapters` column, by putting [fastp arguments](https://github.com/OpenGene/fastp?tab=readme-ov-file#adapters) in quotation marks (e.g. `"--adapter_sequence ACGCGATCG --adapter_sequence_r2 GCTAGCGTACT"`).
31+
Automatic adapter trimming can be enabled by setting the keyword `auto_trim` (Please consider the fastp documentation for paired end auto trimming). If the column is empty no trimming will be performed.
3132

3233
Missing values can be specified by empty columns or by writing `NA`. Lines can be commented out with `#`.
3334

@@ -57,7 +58,5 @@ For annotating UMIs two additional columns in `sample.tsv` must be set:
5758
* `fq1` if the UMIs are part of read 1
5859
* `fq2` if the UMIs are part of read 2
5960
* `both` if there are UMIs in both paired end reads
60-
* the path to an additional fastq file containing just the UMI of each fragment in fq1 and fq2 (with the same read names)
61-
* `umi_read_structure`: A read structure defining the UMI position in each UMI record (see https://github.com/fulcrumgenomics/fgbio/wiki/Read-Structures). If `both` reads contain a UMI, specify a read structure for both with whitespace in between (for example, `8M142T 8M142T`). In case a separate fastq file only containg UMI sequences is set the read structure needs to be `+M`.
62-
Read names of UMI records must match the corresponding records of the sample fastqs.
61+
* `umi_len`: Number of bases (UMI length) to be annotated as UMI.
6362

config/samples.tsv

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,2 @@
1-
sample_name alias group platform purity panel umi_read umi_read_structure datatype calling
1+
sample_name alias group platform purity panel umi_read umi_len datatype calling
22
SRR702070 tumor SRR702070_group ILLUMINA 1.0 dna variants

workflow/rules/common.smk

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -371,7 +371,12 @@ def get_fastp_adapters(wildcards):
371371
try:
372372
adapters = unit["adapters"]
373373
if isinstance(adapters, str):
374-
return adapters
374+
# Autotrimming is enabled by default.
375+
# Therefore no adapter parameter needs to be passed.
376+
if adapters == "auto_trim":
377+
return ""
378+
else:
379+
return adapters
375380
return ""
376381
except KeyError:
377382
return ""

workflow/scripts/split-call-tables.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -173,6 +173,7 @@ def variants(
173173
self.pos = pos
174174
self._variants = self._load_variants()
175175
for variant in self._variants:
176+
print(variant.pos)
176177
if variant.pos == pos and variant.alts[0] == alt:
177178
yield variant
178179
if variant.pos > pos:
@@ -195,6 +196,7 @@ def annotate_row(self, row):
195196
)
196197

197198
def _load_variants(self):
199+
print(self.pos)
198200
return self.bcf.fetch(str(self.contig), self.pos - 1, self.end)
199201

200202
@property

0 commit comments

Comments
 (0)