Last updated: 2023-07-03

Checks: 2 0

Knit directory: komputeExamples/

This reproducible R Markdown analysis was created with workflowr (version 1.7.0.1). The Checks tab describes the reproducibility checks that were applied when the results were created. The Past versions tab lists the development history.


Great! Since the R Markdown file has been committed to the Git repository, you know the exact version of the code that produced these results.

Great! You are using Git for version control. Tracking code development and connecting the code version to the results is critical for reproducibility.

The results in this page were generated with repository version 0423156. See the Past versions tab to see a history of the changes made to the R Markdown and HTML files.

Note that you need to be careful to ensure that all relevant files for the analysis have been committed to Git prior to generating the results (you can use wflow_publish or wflow_git_commit). workflowr only checks the R Markdown file, but you know if there are other scripts or data files that it depends on. Below is the status of the Git repository when the results were generated:


Ignored files:
    Ignored:    .DS_Store
    Ignored:    .Rhistory
    Ignored:    .Rproj.user/
    Ignored:    analysis/.DS_Store
    Ignored:    code/.DS_Store

Untracked files:
    Untracked:  analysis/kompute_test_app_v16.Rmd

Unstaged changes:
    Modified:   data/BC.imp.res.v16.RData
    Modified:   data/CC.imp.res.v16.RData
    Modified:   data/OF.imp.res.v16.RData

Note that any generated files, e.g. HTML, png, CSS, etc., are not included in this status report because it is ok for generated content to have uncommitted changes.


These are the previous versions of the repository in which changes were made to the R Markdown (analysis/index.Rmd) and HTML (docs/index.html) files. If you’ve configured a remote Git repository (see ?wflow_git_remote), click on the hyperlinks in the table below to view the files as they were in that past version.

File Version Author Date Message
Rmd 0423156 statsleelab 2023-07-03 updated
Rmd a9f14db statsleelab 2023-07-03 updated
html 0100cc1 statsleelab 2023-07-03 analysis 5 added
Rmd 576d2cb statsleelab 2023-07-03 updated
Rmd e607745 statsleelab 2023-07-03 added comparison in application study
html 9b680ec statsleelab 2023-06-20 doc updated
Rmd 708f488 statsleelab 2023-06-20 typo fixed
Rmd d343f6a statsleelab 2023-06-20 doc updated
html c1dc8ec statsleelab 2023-06-19 updated
Rmd 96c6efb statsleelab 2023-06-19 text updated
Rmd a04b2cf statsleelab 2023-06-19 text
Rmd faa94f6 statsleelab 2023-06-19 text updated
html 8460ab6 statsleelab 2023-06-19 typo fixed
Rmd d61de3c statsleelab 2023-06-19 typo fixed
html d4d5818 statsleelab 2023-06-19 index updated
Rmd 2bc2cfa statsleelab 2023-06-19 index updated
Rmd 8a94adb statsleelab 2023-06-19 index updated
html d56cc7a statsleelab 2023-06-19 Build site.
html 7685a09 statsleelab 2023-01-10 first commit
Rmd 32dd775 statsleelab 2023-01-10 first commit
Rmd 0408554 statsleelab 2023-01-10 Start workflowr project.


Here, we present a comprehensive comparative study evaluating the KOMPUTE imputation algorithm with data from the International Mouse Phenotyping Consortium (IMPC). Our analysis explores the IMPC data version 16 across three distinct phenotyping domains: Body Composition, Clinical Chemistry and Open Field.

In our initial phase (Analysis 1), we assessed the performance of the KOMPUTE method by conducting extensive simulation studies. We then compare its performance to that of a Singular Value Decomposition (SVD) matrix completion method (Kurucz, et al., 2007).

To further evaluate the effectiveness of KOMPUTE in more realistic scenarios, we conducted additional experiments (Analyses 2-4) employing actual measured association Z-scores from three phenotype domains: Body Composition (8 phenotypes), Clinical Chemistry (19 phenotypes), and Open Field (14 phenotypes). For each domain, we started by estimating a phenotype correlation matrix utilizing control mice phenotypes and adjusted for non-biological factors via PVCA and ComBat. Subsequently, we randomly masked 1000 gene-phenotype association Z-scores from each domain. Using the remaining Z-scores alongside the estimated phenotype correlation matrix, we imputed the missing values employing the KOMPUTE method.

IMPC Data - Version 16