GitHub - ekernf01/rlookc: Leave-one-out knockoffs for efficient structure learning in R

An application-oriented R package for constructing and validating knockoffs

The framework of Model-X knockoffs and the corresponding R package knockoff together provide FDR control for subset selection in sparse regression models with multivariate Gaussian covariates (1). Model-X knockoffs have also been applied for structure learning via graphical lasso (2) by leaving out each variable in turn, and they have been mathematically extended to enable testing of composite hypotheses (3,4,5) and mixture models for covariates (6).

To facilitate progress in applications, especially in causal modeling of transcription, this package provides efficient free software for structure learning via (l)eave-(o)ne-(o)ut (k)nockoff (c)onstruction in R (hence rlookc). We offer both simple and composite null hypotheses, plus an implementation of Gaussian mixture model knockoffs. We also include certain features for checking model assumptions and FDR calibration, notably the K-nearest neighbors exchangeability test from (7).

Getting started

To install, run this. (You need the devtools package.)

devtools::install_github("ekernf01/rlookc")

The most informative vignettes for new users are:

vignette_basic (basic usage)
vignette_calibration (how to check your models)
vignette_knockoff_construction (how to generate different types of knockoffs)

To see how we use this package in our applications of interest, check out the central project repo.

Derivations

This package implements certain order-of-magnitude speedups through linear algebra tricks. Those are derived in the appendices of our paper applying this work to transcriptional regulation.

References

Candes, E., Fan, Y., Janson, L., & Lv, J. (2016). Panning for gold: Model-X knockoffs for high-dimensional controlled variable selection. arXiv preprint arXiv:1610.02351.
Zheng, Z., Zhou, J., Guo, X., & Li, D. (2018). Recovering the graphical structures via knockoffs. Procedia Computer Science, 129, 201-207.
Dai, R., & Barber, R. (2016, June). The knockoff filter for FDR control in group-sparse and multitask regression. In International Conference on Machine Learning (pp. 1851-1859). PMLR.
Multi-resolution localization of causal variants across the genome. M. Sesia, E. Katsevich, S. Bates, E. Candès, C. Sabatti Nature Communications, 11, 1093 (2020). https://www.nature.com/articles/s41467-020-14791-2
Katsevich, E., & Sabatti, C. (2019). Multilayer knockoff filter: Controlled variable selection at multiple resolutions. The annals of applied statistics, 13(1), 1.
Gimenez, J. R., Ghorbani, A., & Zou, J. (2019, April). Knockoffs for the mass: new feature importance statistics with false discovery guarantees. In The 22nd International Conference on Artificial Intelligence and Statistics (pp. 2125-2133). PMLR.
Romano, Y., Sesia, M., & Candès, E. (2020). Deep knockoffs. Journal of the American Statistical Association, 115(532), 1861-1872.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
R		R
man		man
renv		renv
tests		tests
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.Rprofile		.Rprofile
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
renv.lock		renv.lock
rlookc.Rproj		rlookc.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An application-oriented R package for constructing and validating knockoffs

Getting started

Derivations

References

About

Releases 1

Packages

Languages

License

ekernf01/rlookc

Folders and files

Latest commit

History

Repository files navigation

An application-oriented R package for constructing and validating knockoffs

Getting started

Derivations

References

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages