Skip to content

Commit

Permalink
bold.analyze.align documentation update
Browse files Browse the repository at this point in the history
  • Loading branch information
sameerpadhye committed Sep 13, 2024
1 parent d15622d commit 3c5b89d
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
2 changes: 1 addition & 1 deletion R/bold.analyze.align.R
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,7 @@
#' @param ... additional arguments that can be passed to `msa::msa()` function.
#'
#' @details
#' `bold.analyze.align` retrieves the sequence information obtained using `bold.fetch` function and performs a multiple sequence alignment. Type of clustering method can be specified using the `align.method` argument. It utilizes the `msa::msa()` function with default settings but additional arguments from the `msa` function can be passed via the `...` argument. Marker name provided must match with the standard marker names (Ex. COI-5P) available on the BOLD webpage (https://boldsystems.org/; Ratnasingham et al. 2024; pg.404). Name for individual sequences in the output can be customized by using the `seq.name.fields` argument. If more than one field is specified, the name will follow the sequence of the fields given in the vector. Performing a multiple sequence alignment on large sequence data might slow the system. Additionally, users are responsible for verifying the sequence quality and integrity, as the function does not provide any checks on issues like STOP codons and indels within the data by default. The output of this function is a modified Barcode Core Data Model (BCDM) dataframe, which includes two additional columns: one for the aligned sequences and another for the names given to the sequences.
#' `bold.analyze.align` retrieves the sequence information obtained using `bold.fetch` function and performs a multiple sequence alignment. It utilizes the `msa::msa()` function with default settings but additional arguments from the `msa` function can be passed via the `...` argument. Type of clustering method can be specified using the `align.method` argument(`Muscle`,`ClustalW` and `ClustalOmega` are available using the `msa` package). Marker name provided must match with the standard marker names (Ex. COI-5P) available on the \href{https://boldsystems.org/}{BOLD webpage} (Ratnasingham et al. 2024; pg.404). Name for individual sequences in the output can be customized by using the `seq.name.fields` argument. If more than one field is specified, the name will follow the sequence of the fields given in the vector. Performing a multiple sequence alignment on large sequence data might slow the system. Additionally, users are responsible for verifying the sequence quality and integrity, as the function does not provide any checks on issues like STOP codons and indels within the data by default. The output of this function is a modified Barcode Core Data Model (BCDM) dataframe, which includes two additional columns: one for the aligned sequences and another for the names given to the sequences.
#'
#' \emph{Note: }. Users are required to install and load the `Biostrings` and `msa` packages using `BiocManager` before running this function.
#'
Expand Down
2 changes: 1 addition & 1 deletion man/bold.analyze.align.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

0 comments on commit 3c5b89d

Please sign in to comment.