CBIcall execution requires:
A folder containing paired-end FASTQ files, for example:
CNAG999_exome/CNAG99901P_ex//*{R1,R2}*fastq.gz
Example input data is available under examples/input/.
CBIcall is configured through a YAML parameters file.
Below are the main parameters and their default values.
mode: single
pipeline: wes
input_dir: null
sample_map: null
workflow_engine: bash
gatk_version: gatk-4.6
genome: b37
cleanup_bam: false
workflow_rule: null
allow_partial_run: false
project_dir: cbicallorganism: Homo sapiens
technology: Illumina HiSeqCBIcall creates a dedicated project directory to store analysis outputs. Multiple runs can coexist without modifying the original inputs.
Set to true to delete 01_bam/*.{bam,bai} after the workflow completes.
Supported values:
gatk-3.5gatk-4.6
Supported values:
b37(default for non-mtDNA runs)hg38rsrs(forced forpipeline: mit)
Supported values:
singlecohort
Supported values:
weswgsmit
For cohort analysis, complete the corresponding single analysis for each sample first.
Prefix used for the generated run directory, for example cancer_sample_001.
It can also contain a path such as foo/cancer_sample_001.
When input_dir is defined, the final run directory is created below input_dir, with a unique run identifier appended automatically.
Path, relative or absolute, to the directory containing FASTQ inputs for analysis.
Example:
examples/input/CNAG999_exome/CNAG99901P_ex
Cohort-mode parameter pointing to the TSV file containing sample identifiers and GVCF paths.
Example:
Supported workflow engines:
bashsnakemake
Optional workflow target used to start a partial run. Leave it unset for a normal full workflow execution.
Partial runs are currently supported for the snakemake workflow engine.
Safety switch for partial runs. Set it to true together with workflow_rule to explicitly allow a partial workflow start.
If workflow_rule is set but allow_partial_run is not true, CBIcall refuses to start the run.
cbicall -p <parameters_file.yaml> -t <n_threads> [options]
Arguments:
-p,--param: Parameters input file (YAML)-t,--threads: Number of CPUs/Cores/Threads
Options:
-debug,--debug: Debugging level-h,--help: Brief help message-man: Full documentation-v,--version: Show version information-verbose: Enable verbose output-nc,--no-color: Disable ANSI colors in STDOUT
bin/cbicall -p param_file.yaml -t 8
bin/cbicall -p param_file.yaml -t 4 -verbose
bin/cbicall -p param_file.yaml -t 16 > log 2>&1
python3 -u ../../bin/cbicall -p param.yaml -t 8 --no-color > log
$path_to_cbicall/bin/cbicall -p param_file.yaml -t 8 -debug 5
nohup bin/cbicall -p param_file.yaml -t 4 &Partial runs are intended for workflow-layer restarts or targeted execution, without changing the output directory naming scheme.
Example:
workflow_engine: snakemake
workflow_rule: call_variants
allow_partial_run: trueBehavior:
- if
workflow_ruleis unset, the full workflow runs - if
workflow_ruleis set, the run is marked as partial in metadata and CLI output - if
workflow_ruleis set withoutallow_partial_run: true, the run is rejected
Each run creates a project directory containing:
- the workflow log file
log.jsonwith CLI arguments, resolved configuration, and YAML parameters- workflow-generated outputs
The project directory name is not changed when workflow_rule is used.
CBIcall is optimized for multi-core Linux desktop, workstation, or server environments.
- Works on amd64 and arm64 architectures, including Apple Silicon
- Debian-based distributions are preferred, though others may work
- At least 8 GB RAM
- At least 4 CPU cores
- At least 250 GB disk space
These are required only when installing CBIcall from source:
- Python 3.8 or newer
- Java 8
- Snakemake