Utility
The original intent of assembling a data set of publicly-available tumor-infiltrating T cells (TILs) with paired TCR sequencing was to expand and improve the scRepertoire R package. However, after some discussion, we decided to release the data set for everyone, a complete summary of the sequencing runs and the sample information can be found in the meta data of the Seurat object.
Folder Structure
├── code
│ ├── Processing_Utility.Rmd - general processing script
│ └── Summarize_Data.Rmd - script to get summary data
├── data
│ ├── SequencingRuns - 10x Outputs
│ └── processedData - Processed .rds and larger combined cohorts
├── NEWS.txt - changes made
├── outputs
│ └── qc - plots for quality control purposes
├── README.md
└── summaryInfo
├── TcellSummaryTable.csv
├── cohortSummaryTable.csv
├── meta.data.headers.txt - what the meta data headers mean
├── sample.directory.xlsx - all the available data for the cohort
├── sessionInfo.txt - what I am running in terms of the pipeline
└── tumorSummaryTable.csv
Sample ID:
Cohort Information
Here is the current list of data sources, the number of cells that passed filtering by tissue type. Please cite the data if you are using uTILity.
| Tumor | Normal | Blood | Juxta | LN | Met | Cancer Type | Citations | |
|---|---|---|---|---|---|---|---|---|
| CCR-20-4394 | 26760 | 0 | 0 | 0 | 0 | 0 | Ovarian | cite |
| EGAS00001004809 | 181667 | 0 | 0 | 0 | 0 | 0 | Breast | cite |
| GSE114724 | 27651 | 0 | 0 | 0 | 0 | 0 | Breast | cite |
| GSE121636 | 11436 | 0 | 12319 | 0 | 0 | 0 | Renal | cite |
| GSE123814 | 78034 | 0 | 0 | 0 | 0 | 0 | Multiple | cite |
| GSE139555 | 93160 | 78625 | 25363 | 0 | 0 | 0 | Multiple | cite |
| GSE145370 | 66592 | 40916 | 0 | 0 | 0 | 0 | Esophageal | cite |
| GSE148190 | 2263 | 0 | 6201 | 0 | 15644 | 0 | Melanoma | cite |
| GSE154826 | 14491 | 13414 | 0 | 0 | 0 | 0 | Lung | cite |
| GSE159251 | 8356 | 0 | 47721 | 0 | 5705 | 0 | Melanoma | cite |
| GSE162500 | 14644 | 0 | 23401 | 3761 | 0 | 0 | Lung | cite |
| GSE164522 | 36990 | 86811 | 46027 | 0 | 46376 | 36648 | Colorectal | cite |
| GSE168844 | 0 | 0 | 55302 | 0 | 0 | 0 | Lung | cite |
| GSE176021 | 436609 | 128411 | 132673 | 0 | 71063 | 32011 | Lung | cite |
| GSE179994 | 78574 | 0 | 0 | 0 | 0 | 62341 | Lung | cite |
| GSE180268 | 23215 | 0 | 0 | 0 | 29699 | 0 | HNSCC | cite |
| GSE181061 | 40429 | 27622 | 37426 | 0 | 0 | 0 | Renal | cite |
| GSE185206 | 163294 | 17231 | 0 | 0 | 9820 | 0 | Lung | cite |
| GSE195486 | 122512 | 0 | 0 | 0 | 0 | 0 | Ovarian | cite |
| GSE200218 | 0 | 0 | 0 | 0 | 0 | 18495 | Melanoma | cite |
| GSE200996 | 86235 | 0 | 152722 | 0 | 0 | 0 | HNSCC | cite |
| GSE201425 | 22888 | 0 | 27781 | 0 | 11350 | 12253 | Biliary | cite |
| GSE211504 | 0 | 0 | 33685 | 0 | 0 | 0 | Melanoma | cite |
| GSE212217 | 0 | 0 | 229505 | 0 | 0 | 0 | Endometrial | cite |
| GSE213243 | 2835 | 0 | 18363 | 0 | 0 | 2693 | Ovarian | cite |
| GSE215219 | 26303 | 0 | 66000 | 0 | 0 | 0 | Lung | cite |
| GSE227708 | 53087 | 0 | 0 | 0 | 0 | 0 | Merkel Cell | cite |
| GSE242477 | 41595 | 0 | 21595 | 0 | 0 | 0 | Melanoma | cite |
| PRJNA705464 | 98892 | 15113 | 30340 | 0 | 3505 | 0 | Renal | cite |