-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathindex.xml
272 lines (203 loc) · 22.2 KB
/
index.xml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
<?xml version="1.0" encoding="utf-8" standalone="yes" ?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
<channel>
<title>Alexander Lab @ WHOI on Alexander Lab @ WHOI</title>
<link>https://alexanderlabwhoi.github.io/</link>
<description>Recent content in Alexander Lab @ WHOI on Alexander Lab @ WHOI</description>
<generator>Source Themes Academic (https://sourcethemes.com/academic/)</generator>
<language>en-us</language>
<copyright>&copy; 2019</copyright>
<lastBuildDate>Mon, 01 Jan 2024 00:00:00 -0500</lastBuildDate>
<atom:link href="/" rel="self" type="application/rss+xml" />
<item>
<title>Microeukaryote metabolism across the western North Atlantic Ocean revealed through autonomous underwater profiling</title>
<link>https://alexanderlabwhoi.github.io/publication/cohen-2024-microeukaryote/</link>
<pubDate>Mon, 01 Jan 2024 00:00:00 -0500</pubDate>
<guid>https://alexanderlabwhoi.github.io/publication/cohen-2024-microeukaryote/</guid>
<description></description>
</item>
<item>
<title>Molecular forecasting of toxic bloom events</title>
<link>https://alexanderlabwhoi.github.io/publication/alexander-2024-molecular/</link>
<pubDate>Mon, 01 Jan 2024 00:00:00 -0500</pubDate>
<guid>https://alexanderlabwhoi.github.io/publication/alexander-2024-molecular/</guid>
<description></description>
</item>
<item>
<title>The Dawn of the BioGeoSCAPES Program</title>
<link>https://alexanderlabwhoi.github.io/publication/saito-2024-dawn/</link>
<pubDate>Mon, 01 Jan 2024 00:00:00 -0500</pubDate>
<guid>https://alexanderlabwhoi.github.io/publication/saito-2024-dawn/</guid>
<description></description>
</item>
<item>
<title>sourmash v4: A multitool to quickly search, compare, and analyze genomic and metagenomic data sets</title>
<link>https://alexanderlabwhoi.github.io/publication/irber-2024-sourmash/</link>
<pubDate>Mon, 01 Jan 2024 00:00:00 -0500</pubDate>
<guid>https://alexanderlabwhoi.github.io/publication/irber-2024-sourmash/</guid>
<description></description>
</item>
<item>
<title>Reverse engineering environmental metatranscriptomes clarifies best practices for eukaryotic assembly</title>
<link>https://alexanderlabwhoi.github.io/publication/krinos-2023/</link>
<pubDate>Wed, 01 Mar 2023 00:00:00 -0500</pubDate>
<guid>https://alexanderlabwhoi.github.io/publication/krinos-2023/</guid>
<description></description>
</item>
<item>
<title>Using anvi'o in interactive mode</title>
<link>https://alexanderlabwhoi.github.io/post/2022-10-24_anvio/</link>
<pubDate>Mon, 24 Oct 2022 15:25:12 -0400</pubDate>
<guid>https://alexanderlabwhoi.github.io/post/2022-10-24_anvio/</guid>
<description>
<p>Anvi&rsquo;o is a powerful tool for analyzing and vizualizing &lsquo;omics data, much of which relies on one&rsquo;s ability to visualize those data. On Poseidon, it can sometimes be difficult to work with these kinds of plotting tools, because we&rsquo;re working on the command line. To use anvi&rsquo;o interactively on Poseidon, we need to go through a few extra steps.</p>
<p>I&rsquo;ll walk you through the whole installation of anvi&rsquo;o here from start to finish! Most of these instructions are also available on anvio.org. The other half of these instructions are mostly in the posts <a href="https://alexanderlabwhoi.github.io/post/2019-03-08_jpn_slurm/" target="_blank">here</a> (for Mac) and <a href="https://alexanderlabwhoi.github.io/post/2019-07-24-slurm-win/" target="_blank">here</a> (for Windows) that we&rsquo;ve previously uploaded to the blog!</p>
<h3 id="step-one-create-a-fresh-conda-environment-for-anvi-o-and-install-dependencies">Step One: create a fresh <code>conda</code> environment for anvi&rsquo;o and install dependencies</h3>
<p>I like using <code>mamba</code>. That will make the rest of your installs much faster. So I&rsquo;ve modified anvi&rsquo;o&rsquo;s installation instructions a little bit to fit that principle. After logging into Poseidon&rsquo;s login node (and making sure that you have the most recent and/or your preferred anaconda module loaded), we&rsquo;ll run:</p>
<pre><code>conda create -y --name anvio python=3.6 mamba
</code></pre>
<p>We&rsquo;ll wait for that to finish, then execute <code>conda activate anvio</code> (or whatever you chose to name your environment! Next, we have a whole list of packages we&rsquo;ll want mamba to go ahead and (efficiently) install for us in order for anvi&rsquo;o to work. You should be able to just copy, paste, and execute this entire code block right in your terminal window.</p>
<pre><code>mamba install -y -c bioconda &quot;sqlite&gt;=3.31.1&quot;
mamba install -y -c bioconda prodigal
mamba install -y -c bioconda mcl
mamba install -y -c bioconda muscle=3.8.1551
mamba install -y -c bioconda hmmer
mamba install -y -c bioconda diamond
mamba install -y -c bioconda blast
mamba install -y -c bioconda megahit
mamba install -y -c bioconda spades
mamba install -y -c bioconda bowtie2 tbb=2019.8
mamba install -y -c bioconda bwa
mamba install -y -c bioconda samtools=1.9
mamba install -y -c bioconda centrifuge
mamba install -y -c bioconda trimal
mamba install -y -c bioconda iqtree
mamba install -y -c bioconda trnascan-se
mamba install -y -c bioconda r-base
mamba install -y -c bioconda r-stringi
mamba install -y -c bioconda r-tidyverse
mamba install -y -c bioconda r-magrittr
mamba install -y -c bioconda r-optparse
mamba install -y -c bioconda bioconductor-qvalue
mamba install -y -c bioconda fasttree
mamba install -y -c bioconda vmatch
mamba install -y -c bioconda fastani ## anvio says you don't need this one
</code></pre>
<p>Note: anvi&rsquo;o&rsquo;s website says that you don&rsquo;t need the <code>fastani</code> package installed for the software to work. The install went fine for me. The whole install process might take quite a while, depending on the speed of the connection. If you elect to use <code>conda</code> over <code>mamba</code>, it will take a <em>lot</em> longer.</p>
<h3 id="step-two-actually-install-anvi-o">Step Two: actually install anvi&rsquo;o</h3>
<p>You should execute this in a folder where you don&rsquo;t mind software files being downloaded to:</p>
<pre><code>curl -L https://github.com/merenlab/anvio/releases/download/v7.1/anvio-7.1.tar.gz \
--output anvio-7.1.tar.gz
pip install anvio-7.1.tar.gz
</code></pre>
<p>Now, you should be good to go to run anvi&rsquo;o!</p>
<h3 id="step-three-get-a-worker-node-spun-up-to-do-your-bidding">Step Three: get a worker node spun up to do your bidding</h3>
<p>We&rsquo;ll run an interactive job on Poseidon using <code>srun</code>. You can modify this command as needed if you want this job to go for longer than two hours&hellip;or if people are hogging all the <code>compute</code> nodes and you need to hedge your bets with a pre-empt-able <code>scavenger</code> run, for example.</p>
<pre><code>srun -p compute --time=02:00:00 --mem=50gb --pty bash
</code></pre>
<p>Eventually, this should log you into a clean worker node, so your prompt should change from <username>@poseidon-l<1 or 2>.whoi.edu to <username>@pn&lt;# of node you were assigned&gt;. Now, you&rsquo;re ready to not annoy the IT department by executing expensive programs on the common login node! Make sure now to <code>conda activate anvio</code>.</p>
<h3 id="step-four-run-anvi-o-interactive-but-make-sure-to-specify-a-port">Step Four: run anvi&rsquo;o interactive, but make sure to specify a port</h3>
<p>In order to run this, I am going to use the example the <a href="https://merenlab.org/2016/02/27/the-anvio-interactive-interface/" target="_blank">Eren lab</a> provides, since I don&rsquo;t use anvi&rsquo;o much. Note that their link is broken on their tutorial website. I found one that works <a href="https://github.com/meren/anvio/blob/master/anvio/tests/sandbox/example-newick-tree.txt" target="_blank">here</a>. I just saved the Newick text at that link to a file called <code>meren_tree.txt</code>, then (inside my interactive session!), I executed:</p>
<pre><code>anvi-interactive -t meren_tree.txt -p profile.db --title 'Simple tree test' --server-only -P 5785
</code></pre>
<h3 id="step-five-log-into-your-node-and-forward-that-port-to-your-browser">Step Five: log into your node and forward that port to your browser</h3>
<p>All of these steps are straight out of our tutorial on running a Jupyter notebook remotely that I linked above. If you use Windows, be sure to check out that post - I&rsquo;m only using the Mac command here.</p>
<pre><code>ssh -t -t &lt;username&gt;@poseidon-l1.whoi.edu -L 5785:localhost:5785 ssh &lt;the name of the &quot;pn&quot; node from above that showed up when you ran srun&gt; -L 5785:localhost:5785
</code></pre>
<p>To summarize, above you need to swap out:
- your own username
- the port number if you changed it
- the name of the worker node you ssh&rsquo;d into</p>
<p>For example, if my username is &ldquo;akrinos&rdquo;, I used port 5786, and I ssh&rsquo;d into &ldquo;pn075&rdquo;, I would type:</p>
<pre><code>ssh -t -t [email protected] -L 5786:localhost:5786 ssh pn075 -L 5786:localhost:5786
</code></pre>
<p>Be sure to check out our more detailed post on ssh tunneling if you want shortcuts for doing this!</p>
<h3 id="step-six-access-your-interactive-session">Step Six: access your interactive session</h3>
<p>Simply type <code>localhost:5786</code> into your web browser (replacing this second number with your present node), and you should be able to see your anvi&rsquo;o interactive session! Feel free to use any <code>anvi-interactive</code> command and accessory files that you want to after creating/uploading those files on Poseidon.</p>
<p>Happy interacting!s</p>
</description>
</item>
<item>
<title>Bio-GO-SHIP: The Time Is Right to Establish Global Repeat Sections of Ocean Biology</title>
<link>https://alexanderlabwhoi.github.io/publication/clayton-2022-bio/</link>
<pubDate>Sat, 01 Jan 2022 00:00:00 -0500</pubDate>
<guid>https://alexanderlabwhoi.github.io/publication/clayton-2022-bio/</guid>
<description></description>
</item>
<item>
<title>Marine microeukaryote metatranscriptomics: sample processing and bioinformatic workflow recommendations for ecological applications</title>
<link>https://alexanderlabwhoi.github.io/publication/cohen-2022-marine/</link>
<pubDate>Sat, 01 Jan 2022 00:00:00 -0500</pubDate>
<guid>https://alexanderlabwhoi.github.io/publication/cohen-2022-marine/</guid>
<description></description>
</item>
<item>
<title>Microbiomes of bloom-forming Phaeocystis algae are stable and consistently recruited, with both symbiotic and opportunistic modes</title>
<link>https://alexanderlabwhoi.github.io/publication/mars-2022-microbiomes/</link>
<pubDate>Sat, 01 Jan 2022 00:00:00 -0500</pubDate>
<guid>https://alexanderlabwhoi.github.io/publication/mars-2022-microbiomes/</guid>
<description></description>
</item>
<item>
<title>Bio-GO-SHIP: Linking marine biodiversity and biogeochemistry</title>
<link>https://alexanderlabwhoi.github.io/project/biogoship/</link>
<pubDate>Fri, 02 Jul 2021 15:15:49 -0400</pubDate>
<guid>https://alexanderlabwhoi.github.io/project/biogoship/</guid>
<description><p>The <a href="http://biogoship.org/" target="_blank">Bio-GO-SHIP</a> program aims to integrate the latest biological observing technologies into the Global Ocean Ship-based Hydrographic Investigation Program (GO-SHIP, <a href="http://go-ship.org/" target="_blank">http://go-ship.org/</a>). GO-SHIP runs a global repeat-section effort, wherein reference sections that transect major ocean regions are sampled every decade to observe global changes. Bio-GO-SHIP measurements will include optical, flow cytometric, metagenomic and metatranscriptomic, and particulate sampling. Compared to previous global ocean sequencing efforts, Bio-GO-SHIP will sample biological parameters at high spatial and vertical resolution, with ‘omic samples being taken approximately every degree, or approximately 26.5km on average between samples. My group is focused on the collection, sequencing, and preliminary assembly data product curation of the ‘omic dataset in conjunction with Adam Martiny’s lab&ndash; with my lab taking the lead on the eukaryotic component.</p>
<p><em>Funding for this project provided by NOAA and the Simons Foundation</em></p>
</description>
</item>
<item>
<title>EUKulele: Taxonomic annotation</title>
<link>https://alexanderlabwhoi.github.io/project/eukulele/</link>
<pubDate>Fri, 02 Jul 2021 15:15:49 -0400</pubDate>
<guid>https://alexanderlabwhoi.github.io/project/eukulele/</guid>
<description><p>Easy, accurate classification of the taxonomy of microbial eukaryotes in the environment is a persistent challenge. For metagenomic and metatranscriptomic data, annotation is imperfect, because genomic references for organisms isolated from the environment are lacking. By leveraging experimental transcriptomic references, we have designed a tool to estimate the taxonomy of marine microbial eukaryotes. The tool is a flexible and modular way to leverage the expanding amount of omics data on environmentally-isolated microbial eukaryotes in order to better annotate multi-organism omics datasets (meta-omics). The freely-available tool is implemented as a Python package and installable via both <code>pip</code> and <code>conda</code>.</p>
<p><strong>Project Repository</strong>: <a href="https://github.com/AlexanderLabWHOI/eukulele" target="_blank">https://github.com/AlexanderLabWHOI/eukulele</a></p>
</description>
</item>
<item>
<title>Ecological Role of Eukaryotic Pangenomes</title>
<link>https://alexanderlabwhoi.github.io/project/ehuxleyi/</link>
<pubDate>Fri, 02 Jul 2021 15:15:49 -0400</pubDate>
<guid>https://alexanderlabwhoi.github.io/project/ehuxleyi/</guid>
<description><p>Coccolithophores, a group of calcifying marine phytoplankton within the phylum Haptophyta, play a significant role in marine biogeochemical cycles, particularly in those of carbon and sulfur. <em>Emiliania huxleyi</em> is one of the most abundant extant coccolithophore species in the ocean, typified by its cosmopolitan distribution and ability to form large blooms. Intra-specific variability in physiology (e.g. enzymatic rates, morphology, growth rates) of <em>E. huxleyi</em> has long been known in cultured isolates. The capacity for this variability was revealed through the genome sequencing of many <em>E. huxleyi</em> strains, the results of which suggested that <em>E. huxleyi</em> has a ‘pan genome,’ with gene content varying across strains. Intensive study of another ecologically significant taxa with a pan genome, the cyanobacteria <em>Prochlorococcus</em>, has connected the variability of the pan genome to its niche separation and biogeographical distribution. Similar to <em>Prochlorococcus</em>, intra-specific genomic variability may underpin the success of <em>E. huxleyi</em> across diverse environmental conditions and underlie its observed physiological variability. Here we are combining a series of genomic, computational field surveys, and laboratory-based experiments designed to identify the function of the pan genome in <em>E. huxleyi</em> ecology and biogeography. Through this project, we will address the questions:</p>
<ol>
<li>Is there structure to the variable gene content across strains in the pan genome of <em>E. huxleyi</em>?</li>
<li>Does the environment select for particular strains of <em>E. huxleyi</em> based on the content of the variable portion of the pan genome?</li>
<li>Does the composition of the variable gene set determine strain success under shifting environmental conditions?</li>
</ol>
<p><em>Funding for this project provided by the National Science Foundation</em></p>
</description>
</item>
<item>
<title>EukHeist: Recovering Eukaryotic MAGs from Metagenomes</title>
<link>https://alexanderlabwhoi.github.io/project/eukheist/</link>
<pubDate>Fri, 02 Jul 2021 15:15:49 -0400</pubDate>
<guid>https://alexanderlabwhoi.github.io/project/eukheist/</guid>
<description><p>Eukaryotic microbes, or protists, are key drivers of marine ecosystems, with roles ranging from primary producers (phytoplankton) to consumers (heterotrophs or mixotrophs). As with their bacteria and archaea, many protists are difficult or impossible to culture. This has limited our ability to directly interrogate their biology in the laboratory and has led us to underestimate their diversity across marine ecosystems. Molecular and genomic approaches, particularly those applied to whole, mixed communities (e.g. metagenomics, metatranscriptomics), have shed light on the ecological roles, evolutionary histories, and physiological capabilities of these organisms.</p>
<p>EukHeist presents a scalable and reproducible bioinformatic pipeline to facilitate the retrieval of eukaryotic metagenome assembled genomes (MAGs) from mixed metagenomes, called EUKHeist (<a href="https://github.com/AlexanderLabWHOI/EUKHeist" target="_blank">https://github.com/AlexanderLabWHOI/EUKHeist</a>). EUKHeist streamlines and automates the discovery, recovery, quality assessment, and analysis of eukaryotic metagenome assembled genomes (MAGs) from mixed community metagenomes. EukHeist incorporates taxonoßßmic annotation through EUKulele (<a href="https://github.com/AlexanderLabWHOI/EUKulele" target="_blank">https://github.com/AlexanderLabWHOI/EUKulele</a>) as well as scalable structural and functional annotation of coding regions in eukaryotic MAGs with EukMetaSanity (<a href="https://github.com/cjneely10/EukMetaSanity" target="_blank">https://github.com/cjneely10/EukMetaSanity</a>).</p>
<p><strong>Project repository:</strong> <a href="https://github.com/AlexanderLabWHOI/EUKHeist" target="_blank">https://github.com/AlexanderLabWHOI/EUKHeist</a></p>
<p><em>Funding for this project provided by the Simons Foundation and WHOI</em></p>
</description>
</item>
<item>
<title>Phaeocystis</title>
<link>https://alexanderlabwhoi.github.io/project/phaeocystis/</link>
<pubDate>Fri, 02 Jul 2021 15:15:49 -0400</pubDate>
<guid>https://alexanderlabwhoi.github.io/project/phaeocystis/</guid>
<description><p>Phytoplankton blooms are central to the ecology of Cape Cod Bay, where historically spring blooms of diatoms have been rapidly grazed, shuttling biomass through the food web. In more recent years, blooms of a haptophyte, <em>Phaeocystis pouchetii</em> have become more frequent in Cape Cod Bay. These organisms produce large gelatinous colonies, which prevents blooms of this haptophyte from being effectively grazed, hence reducing carbon cycling in the upper water column. We are partnering with the Center for Coastal Studies in Provincetown to find and study these blooms.</p>
<p><em>Funding from the Simons Foundation</em></p>
</description>
</item>
<item>
<title>SeqWell for Marine Microbes</title>
<link>https://alexanderlabwhoi.github.io/project/seqwell/</link>
<pubDate>Fri, 02 Jul 2021 15:15:49 -0400</pubDate>
<guid>https://alexanderlabwhoi.github.io/project/seqwell/</guid>
<description><p>A central project aim is adaptation of a new single cell transcriptome (SCT) sequencing approach called <a href="https://shaleklab.com/publication/seq-well-portable-low-cost-rna-sequencing-of-single-cells-at-high-throughput/" target="_blank">SeqWell</a> for analysis of natural microbial communities. The method addresses two overarching obstacles to microbial symbiosis investigations: 1) the requirement that symbionts be brought into culture for ‘omics analyses, and 2) time resolution limits on bulk approaches because populations are asynchronous. With enough SCTs, pseudotime methods can establish the time course of host and symbiont interactions. SeqWell is a straightforward settling-chamber based system that can capture &gt;10,000 SCTs in a single experiment. Compared to competing methods, it is also low-cost and readily adaptable for a broad range of protist shapes and sizes. We aim to address three major challenges inherent to scRNAseq of protist communities in the environment: 1) the morphological diversity (size/shapes) of protists across taxonomic groups; 2) cost; and 3) a need for bioinformatic pipelines that can analyze SCTs from natural microbial communities.</p>
<p><em>Funding for this project provided by the Moore Foundation</em></p>
</description>
</item>
</channel>
</rss>