-
Notifications
You must be signed in to change notification settings - Fork 32
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #196 from kyrabowden/main
HW 1 Modifications for EC HW 3
- Loading branch information
Showing
2 changed files
with
52 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,52 @@ | ||
--- | ||
layout: post | ||
title: "Correlation of Transcription Factor Expression with Cell and Nucleus Area" | ||
author: Kyra Bowden | ||
jhed: kbowden5 | ||
categories: [ HW1 ] | ||
image: homework/hw1/hw1_kbowden5_EC.png | ||
featured: false | ||
--- | ||
|
||
### What data types are you visualizing? | ||
I am visualizing the quantitative data of the expression count of the FOXA1 gene for each cell, quantitative data of the expression count of GATA5 gene for each cell, and the quantitative data of the cell and nucleus areas. | ||
|
||
### What data encodings (geometric primitives and visual channels) are you using to visualize these data types? | ||
I used the geometric primitive of points. Each point represents a single cell. I used the visual channels of color hue, x-position, and y-position. In the first panel, the color of each point represents the cell area, with the brighter hues indicating larger cells. In the second panel, the color hue represents the nucleus area with light green and yellow points indicating the largest nuclei. I split up the cell and nucleus area so that you can easily see the relationship each has with transcription factor expression and compare these relationships. The x-position of each point is representative of the expression count of the GATA3 gene for that cell and the y-position represents the expression count of the FOXA1 gene for that cell. | ||
|
||
### What about the data are you trying to make salient through this data visualization? | ||
The FOXA1 and GATA3 genes are both known transcription factors. This data visualization makes salient the relationship between transcription factor expression, cell area, and nucleus area. Most notably, the cell with the greatest nuclues area has high levels of expression counts of both the transcription factors while the largest cell areas have the lowest expression counts of both the transcription factors. | ||
|
||
### What Gestalt principles or knowledge about perceptiveness of visual encodings are you using to accomplish this? | ||
The main Gestalt principle used in this data visualization is similarity in color. The cells with smaller nucleus areas are all darker blues while the several with significantly larger nuclei are light green or yellow. This use of similarity in color highlights the increased nucleus area with the icnreased transcription factor expression counts. Additionally, this blue to yellow color palette was used knowing that some people are color blind and cannot distinguish between red and green or blue and orange. | ||
|
||
```{r} | ||
data <- read.csv('/Users/Kyra_1/Documents/College/Junior/Spring/Genomic_Data_Visualization/Homework_1/pikachu.csv.gz', row.names = 1) | ||
dim(data) | ||
data[1:10, 1:10] | ||
colnames(data) | ||
install.packages("viridis") | ||
library(ggplot2) | ||
library(viridis) | ||
library(patchwork) | ||
p1 = ggplot(data)+ | ||
geom_point(aes(x=GATA3, y = FOXA1, col = cell_area, size = 0.4))+ | ||
scale_color_viridis_c(option = 'C')+ | ||
theme_minimal()+ | ||
labs(title = "FOXA1 and GATA3 Impact on Cell Area", | ||
x = "GATA3", | ||
y = "FOXA1") | ||
p2 = ggplot(data)+ | ||
geom_point(aes(x=GATA3, y = FOXA1, col = nucleus_area, size = 0.4))+ | ||
scale_color_viridis_c(option = 'C')+ | ||
theme_minimal()+ | ||
labs(title = "FOXA1 and GATA3 Impact on Nucleus Area", | ||
x = "GATA3", | ||
y = "FOXA1") | ||
## combine the plots | ||
p1+p2 | ||
``` |
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.