Skip to content

Commit

Permalink
comments
Browse files Browse the repository at this point in the history
  • Loading branch information
andrewGhazi committed Nov 11, 2024
1 parent 6253ed6 commit 8833894
Show file tree
Hide file tree
Showing 6 changed files with 47 additions and 30 deletions.
32 changes: 15 additions & 17 deletions episodes/cell_type_annotation.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -83,17 +83,15 @@ which the cells truly reside. After annotation based on marker genes,
the clusters can be treated as proxies for more abstract biological
concepts such as cell types or states.

Popularized by its use in
[Seurat](https://cran.r-project.org/web/packages/Seurat/index.html),
graph-based clustering is a flexible and scalable technique for
clustering large scRNA-seq datasets. We first build a graph where each
node is a cell that is connected to its nearest neighbors in the
high-dimensional space. Edges are weighted based on the similarity
between the cells involved, with higher weight given to cells that are
more closely related. We then apply algorithms to identify "communities"
of cells that are more connected to cells in the same community than
they are to cells of different communities. Each community represents a
cluster that we can use for downstream interpretation.
Graph-based clustering is a flexible and scalable technique for identifying
coherent groups of cells in large scRNA-seq datasets. We first build a graph
where each node is a cell that is connected to its nearest neighbors in the
high-dimensional space. Edges are weighted based on the similarity between the
cells involved, with higher weight given to cells that are more closely related.
We then apply algorithms to identify "communities" of cells that are more
connected to cells in the same community than they are to cells of different
communities. Each community represents a cluster that we can use for downstream
interpretation.

Here, we use the `clusterCells()` function from the
[scran](https://bioconductor.org/packages/scran) package to perform
Expand Down Expand Up @@ -549,12 +547,6 @@ AUCell_exploreThresholds(cell.aucs[10:18], plotHist = TRUE, assign = TRUE)

::::

## Session Info

```{r sessionInfo}
sessionInfo()
```

## Exercises

::: challenge
Expand Down Expand Up @@ -720,3 +712,9 @@ Remember, this is an exploratory diagnostic, not the final word! At this point i
- The `r Biocpkg("AUCell")` package provides an enrichment
test to identify curated marker sets that are highly expressed in each cell.
:::

## Session Info

```{r, tidy=TRUE}
sessionInfo()
```
7 changes: 7 additions & 0 deletions episodes/eda_qc.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -808,3 +808,10 @@ Mathematically, this would require the data to fall on a two-dimensional plane (
[^2]: [Vallejos (2017)](learners/reference.md#litref)
[^3]: [Lun (2016)](learners/reference.md#litref)

## Session Info

```{r, tidy=TRUE}
sessionInfo()
```


5 changes: 5 additions & 0 deletions episodes/hca.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -347,3 +347,8 @@ You can see we don't get very many cells given the strict set of conditions we u
::::::::::::::::::::::::::::::::::::::::::::::::


## Session Info

```{r, tidy=TRUE}
sessionInfo()
```
6 changes: 6 additions & 0 deletions episodes/intro-sce.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -314,3 +314,9 @@ combined_sce

1. Pijuan-Sala B, Griffiths JA, Guibentif C et al. (2019). A single-cell molecular map of mouse gastrulation and early organogenesis. Nature 566, 7745:490-495.

## Session Info

```{r, tidy=TRUE}
sessionInfo()
```

12 changes: 6 additions & 6 deletions episodes/large_data.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -540,12 +540,6 @@ The resulting H5AD file can then be read into Python using scanpy's
[read_h5ad](https://scanpy.readthedocs.io/en/stable/generated/scanpy.read_h5ad.html)
function and then directly used in compatible Python-based analysis frameworks.

## Session Info

```{r sessionInfo}
sessionInfo()
```


## Exercises

Expand Down Expand Up @@ -646,3 +640,9 @@ system.time({i.out <- runPCA(sce.brain,
- Converter functions between existing single-cell data formats enable analysis workflows that leverage complementary functionality from poplular single-cell analysis ecosystems.

::::::::::::::::::::::::::::::::::::::::::::::::

## Session Info

```{r, tidy=TRUE}
sessionInfo()
```
15 changes: 8 additions & 7 deletions episodes/multi-sample.Rmd
Original file line number Diff line number Diff line change
Expand Up @@ -483,12 +483,6 @@ topTags(res.lfc)

Addionally, the choice of \tau can be guided by other external experimental data, like a previous or a pilot experiment.

## Session Info

```{r, tidy=TRUE}
sessionInfo()
```

## Exercises


Expand Down Expand Up @@ -585,7 +579,7 @@ If there were large shifts in the logFC estimates or p-value distributions, that

#### Extension challenge 1: Group effects

Having multiple independent samples in each experimental group is always helpful, but it particularly important when it comes to batch effect correction. Why?
Having multiple independent samples in each experimental group is always helpful, but it's particularly important when it comes to batch effect correction. Why?

::: solution

Expand Down Expand Up @@ -625,3 +619,10 @@ Imagine you had one sample that received a drug treatment and one that did not,
limited replication - except that the counts are not of reads per gene, but
of cells per label.
::::::::::::::::::::::::::::::::::::::::::::::::

## Session Info

```{r, tidy=TRUE}
sessionInfo()
```

0 comments on commit 8833894

Please sign in to comment.