-
Notifications
You must be signed in to change notification settings - Fork 2
Overview
GLUE (Genes Linked by Underlying Evolution) is a flexible software system for genomics. It provides comprehensive tools for storing, managing, and analyzing genetic data. By leveraging advanced DNA sequencing technologies, GLUE aims facilitate comparative genomic analysis and unlock the wealth of information embedded in molecular sequence data
The GLUE framework supports the creation of bespoke, sequence data-oriented 'projects'. These projects not only incorporate essential data items for comparative genomic analysis (e.g. sequences, multiple sequence alignments, genome feature annotations, and other associated data) they also map the semantic relationships between these items.
Projects are loaded into the GLUE "engine," forming a relational database that represents the semantic relationships between data items. This foundation supports systematic comparative analyses and the development of sequence-based resources.
It 'poises' sequences and associated data for application in computational analysis, minimising the requirement for labour-intensive pre-processing of datasets. Furthermore, it allows sequence-based analyses to be implemented in an efficient, standardised and reproducible way.
GLUE projects are equally suited for carrying out exploratory work (e.g., using virus genome data to investigate structural and functional properties of viruses) as they are for implementing operational procedures (e.g., producing standardised reports in a public or animal health setting).
Hosting of GLUE projects in an online version control system (e.g., GitHub) provides a mechanism for their stable, collaborative development.
-
Research Projects:
GLUE provides a powerful framework for managing and analyzing genomic data, enabling researchers to focus on specific biological questions and conduct comparative genomics studies across various viral genomes or other sequence data. -
Websites:
GLUE projects can be developed into interactive, user-friendly web services through the addition of a graphical user interface (GUI). This allows researchers and users to access data, visualizations, and analysis tools in a streamlined manner, enhancing user engagement and accessibility. -
Local Sequence Processing Pipelines:
GLUE can be utilized to create custom local sequence processing pipelines, enabling researchers to automate the analysis of large datasets, perform quality control checks, and implement bioinformatics workflows tailored to specific research needs. These pipelines can include tasks like alignment, variant calling, and annotation. -
Programmatic Public Services:
By exposing its capabilities as web services, GLUE can facilitate automated data retrieval, analysis, and integration, supporting research and clinical workflows that require real-time data processing. This capability allows for seamless interaction with other software and databases, enhancing data interoperability and accessibility. -
Microservices Architecture:
GLUE's deployment within a web server allows it to be integrated into an organization's wider computing infrastructure as part of a microservices software architecture. This modular approach enhances flexibility, scalability, and maintainability in computational environments, especially those handling large-scale genomic data. Each service can be independently developed and deployed, allowing teams to adapt to evolving research needs. -
Educational Tools:
GLUE can be leveraged as an educational resource for teaching bioinformatics and genomics concepts. By providing a platform for students to engage with real genomic data and perform analyses, GLUE can enhance learning outcomes and provide hands-on experience in data science. -
Collaboration Platforms:
GLUE can serve as a collaborative platform where researchers can share data, analyses, and workflows. This fosters interdisciplinary collaboration, allowing teams to combine expertise and resources in tackling complex genomic research questions.
GLUE provides a standardized yet customizable platform for genomic analyses. It enhances collaboration within research teams, supports reproducibility in analyses, and enables integration with existing bioinformatics workflows. Whether you're working on comparative genomics, phylogenetic analysis, or custom data processing, GLUE offers the tools necessary to streamline your research.
GLUE by Robert J. Gifford Lab.
For questions, issues, or feedback, please open an issue on the GitHub repository.
- Project Data Model
- Schema Extensions
- Modules
- Alignments
- Variations
- Scripting Layer
- Freemarker Templates
- Example GLUE Project
- Command Line Interpreter
- Build Your Own Project
- Querying the GLUE Database
- Working With Deep Sequencing Data
- Invoking GLUE as a Unix Command
- Known Issues and Fixes
- Overview
- Hepatitis Viruses
- Arboviruses
- Respiratory Viruses
- Animal Viruses
- Spillover Viruses
- Virus Diversity
- Retroviruses
- Paleovirology
- Transposons
- Host Genes