This function loads and processes contig data from various single-cell
immune receptor sequencing formats. It reads data from a directory
(recursively) or from an already loaded list/data frame, transforms it to a
common structure, and returns a list of contigs ready for downstream analysis
with combineTCR() or combineBCR().
Supported file formats and their expected file names:
10X: "filtered_contig_annotations.csv"AIRR: "airr_rearrangement.tsv"BD: "Contigs_AIRR.tsv"Dandelion: "all_contig_dandelion.tsv"Immcantation: "_data.tsv" (or similar)“JSON`: ".json"
ParseBio: "barcode_report.tsv"MiXCR: "clones.tsv"TRUST4: "barcode_report.tsv"WAT3R: "barcode_results.csv"
Arguments
- input
A directory path containing contig files or a list/data frame of pre-loaded contig data.
- format
A string specifying the data format. Must be one of:
auto,10X,AIRR,BD,Dandelion,JSON,MiXCR,ParseBio,TRUST4,WAT3R, orImmcantation. If "auto", the function attempts automatic format detection.
Value
A list of contigs formatted for use with combineTCR() or
combineBCR(). Rows containing only NA values (aside from the barcode)
are dropped.
Examples
TRUST4 <- read.csv("https://www.borch.dev/uploads/contigs/TRUST4_contigs.csv")
contig.list <- loadContigs(TRUST4, format = "TRUST4")
BD <- read.csv("https://www.borch.dev/uploads/contigs/BD_contigs.csv")
contig.list <- loadContigs(BD, format = "BD")
WAT3R <- read.csv("https://www.borch.dev/uploads/contigs/WAT3R_contigs.csv")
contig.list <- loadContigs(WAT3R, format = "WAT3R")