
Output test statistics and effect size for contingency tables
Source:R/effects.R
      effect_counts_one_grouped.RdChi squared is calculated using stats::chisq.test.
If any cell contains less than 5 observations, the exact-parameter is set.
Arguments
- data
 A tibble.
- col
 The column holding factor values.
- cross
 The column holding groups to compare.
- clean
 Prepare data by data_clean.
- ...
 Placeholder to allow calling the method with unused parameters from effect_counts.
Value
A volker list with two volker tibbles. The first tibble contains npmi values for each combinations:
n Number the combination occurs.
p_x Marginal share of the first category.
p_y Marginal share of the second category.
p_xy Share of the combination based on all combinations.
ratio The ratio of p_xy to (p_x * p_y).
pmi Pointwise Mutual information, derived from the ratio.
npmi Normalized Pointwise Mutual Information, derived from the pmi.
The second tibble contains effect sizes based on the cross table:
Cramer's V: Effect size measuring the association between the two variables.
n: Number of cases the calculation is based on.
Chi-squared: Chi-Squared test statistic. If expected values are below 5 in at least one cell, an exact Fisher test is conducted.
df: Degrees of freedo of the chi-squared test. Empty for the exact Fisher test.
p: p-value of the chi-squared test.
stars: Significance stars based on p-value (*, **, ***).
Details
Phi is derived from the Chi squared value by sqrt(fit$statistic / n).
Cramer's V is derived by sqrt(phi / (min(dim(contingency)[1], dim(contingency)[2]) - 1)).
Cramer's V is set to 1.0 for diagonal contingency matrices, indicating perfect association.
Examples
library(volker)
data <- volker::chatgpt
effect_counts_one_grouped(data, adopter, sd_gender)
#> 
#> 
#> |adopter                                  | sd_gender|  n|  p_x|  p_y| p_xy| ratio|   pmi|  npmi| fisher_p| fisher_stars|
#> |:----------------------------------------|---------:|--:|----:|----:|----:|-----:|-----:|-----:|--------:|------------:|
#> |I try new offers immediately             |    female|  2| 0.15| 0.40| 0.02|  0.34| -1.57| -0.28|    0.142|             |
#> |I try new offers immediately             |      male| 12| 0.15| 0.59| 0.12|  1.35|  0.43|  0.14|    0.187|             |
#> |I try new offers immediately             |   diverse|  1| 0.15| 0.01| 0.01|  6.73|  2.75|  0.41|    0.238|             |
#> |I try new offers rather quickly          |    female| 25| 0.62| 0.40| 0.25|  1.00|  0.00|  0.00|    1.000|             |
#> |I try new offers rather quickly          |      male| 38| 0.62| 0.59| 0.38|  1.02|  0.02|  0.02|    1.000|             |
#> |I wait until offers establish themselves |    female| 13| 0.22| 0.40| 0.13|  1.49|  0.58|  0.20|    0.142|             |
#> |I wait until offers establish themselves |      male|  9| 0.22| 0.59| 0.09|  0.69| -0.54| -0.15|    0.142|             |
#> |I only use new offers when I have no ... |      male|  1| 0.01| 0.59| 0.01|  1.68|  0.75|  0.11|    1.000|             |
#> 
#> Adjusted significance p values with fdr method.
#> 
#> 
#> 
#> |Statistic   | Value|
#> |:-----------|-----:|
#> |Cramer's V  |  0.26|
#> |Chi-squared | 13.48|
#> |n           |   101|
#> |df          |      |
#> |p           | 0.022|
#> |stars       |     *|