Type: Package
Title: Assess the Diagnostic Power of Genomic Marker Combinations
Version: 2.1.1
Description: Population genetics package for designing diagnostic panels. Candidate markers, marker combinations, and different panel sizes are assessed for how well they can predict the source population of known samples. Requires a genotype file of candidate markers in STRUCTURE format. Methods for population cross-validation are described in Jombart (2008) <doi:10.1093/bioinformatics/btn129>.
License: MIT + file LICENSE
Contact: Kim Vertacnik <kim.vertacnik@mailbox.org> or Julian Dupuis <julian.dupuis@uky.edu>
URL: https://github.com/OksanaVe/snpAIMeR
Depends: R (≥ 2.10)
Imports: adegenet, doParallel, dplyr, forcats, foreach, ggplot2, graphics, magrittr, parallel, readr, tidyr, utils, withr, yaml
Suggests: testthat (≥ 3.0.0)
Config/testthat/edition: 3
Encoding: UTF-8
RoxygenNote: 7.3.1
NeedsCompilation: no
Packaged: 2024-02-20 20:01:29 UTC; kim
Author: Kim Vertacnik ORCID iD [cre, aut], Oksana Vernygora ORCID iD [aut], Julian Dupuis ORCID iD [aut]
Maintainer: Kim Vertacnik <kim.vertacnik@mailbox.org>
Repository: CRAN
Date/Publication: 2024-02-20 20:20:02 UTC

Assess the Diagnostic Power of Genomic Marker Combinations

Description

Population genetics package for optimizing diagnostic panels. User-selected candidate markers are assessed individually and in combination for how well they can predict the source population of known samples. Requires a genotype file in STRUCTURE format.

Usage

snpAIMeR(run_mode, config_file = NULL, verbose = TRUE)

Arguments

run_mode

Modes are "interactive", "non-interactive", or "example"; mode must be in quotes.

config_file

Yaml file required for "non-interactive" mode; filename/path must be in quotes.

verbose

Default is TRUE.

Details

Yaml file format for "non-interactive" mode (do not include bullet points):

Minimizing run time: Because of the number of possible combinations, we recommend testing no more than 15 markers. For example, testing 15 markers in panel sizes of 1 to 15 (32,767 total combinations) with 1,000 cross-validation replicates on a system with 48 processor cores took about 5 hours and 20 GB RAM. Reducing the number of cross-validation replicates will reduce run time, however, we recommend no less than 100 replicates.

Value

Cross-validation assignment rates for individual markers, marker combinations, and panel sizes. Outputs three .csv and two .pdf files to a user-specified directory.

See Also

https://github.com/OksanaVe/snpAIMeR

Examples

if (requireNamespace("adegenet", quietly = TRUE)) {
  data(nancycats, package = "adegenet")
  snpAIMeR("example", verbose = TRUE)
}