Skip to content

databio/bedbase-loader

Repository files navigation

BEDbase loader

This repository contains GitHub Actions workflows for automated BED file uploads to BEDbase.

For complete documentation on the data loading architecture, see: BEDbase Data Loading

Accbase Loading

Accbase workflows queue ATAC-seq samples for downstream processing on Rivanna HPC.

Workflow Schedule Purpose
accbase_queue_cron.yml Daily 19:00 UTC Queue new samples from accbase namespace
accbase_queue_gse.yml Manual Queue samples from a specific GSE

Unlike BEDbase (which processes files in GitHub Actions + Fargate), Accbase only queues samples in the sample_queue table. Heavy processing (PEPATAC pipeline) runs on Rivanna HPC.

Configuration: accbase_config.yaml

Required Secrets

For Accbase workflows:

  • ACCBASE_POSTGRES_HOST - PostgreSQL host for accbase
  • ACCBASE_POSTGRES_USER - Database user
  • ACCBASE_POSTGRES_PASSWORD - Database password
  • PEPHUB_API_KEY - Optional, for higher rate limits

Queue Consumer

The sample_queue table is consumed on Rivanna:

  1. Consumer queries for samples with status='pending'
  2. Generates looper submission from PEPhub PEP
  3. Submits SLURM jobs via looper
  4. Updates status to 'processing'
  5. On completion, updates status to 'completed'

This decouples data discovery (GitHub Actions) from heavy processing (Rivanna HPC).

About

BEDbase GEO bed file uploading

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages