Test robustness of one way anova when independence assumption is violated





.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty{ height:90px;width:728px;box-sizing:border-box;
}







0















As the title, I want to test the robustness of one way anova when the independence assumption is violated. The scenario is there is correlation between different subgroups. I used mvrnorm to generate correlated datasets, and do the simulation for 1000 times to calculate power of anova for each correlation from 0.1 to 0.9.



However it seems for each iteration the p value of anova stays constant. I hope to know the whether it is because of the way I used to generate correlated subsets is inappropriate or because of something else?? I find in fact the p value does change, but very very slightly... I expect to see when correlation increases, the power will decrease??, but in my case the power is either 1 or 0... (I have put sd for each group to be 1 for simplicity).



myfun3 <- function(index, cor, samplesize, mean1, mean2, mean3){
library(MASS)
data = mvrnorm(n=samplesize, mu=c(mean1, mean2, mean3),
Sigma=matrix(c(1, rep(cor, 3), 1, rep(cor, 3), 1), 3, 3), empirical=TRUE)
newdat<- data.frame(data)
library(tidyr)
newdat_long <- gather(newdat, group, y, X1:X3, factor_key = TRUE)
aov_result <- summary(aov(y ~ group, newdat_long))[[1]][["Pr(>F)"]][[1]]
ks_result <- kruskal.test(y ~ group, newdat_long)$p.value
return(list(aov_result, ks_result))
}

cal_power_cor <- function(iteration = 1000, samplesize=30, cor, mean1 = 10, mean2=10.5, mean3=10.8){
result_list <- lapply(1:iteration, myfun3, samplesize = samplesize, cor = cor, mean1 = mean1, mean2 = mean2,
mean3 = mean3)
result <-data.frame(matrix(unlist(result_list), ncol = 2, byrow = T))
colnames(result)<- c("anova", "kruskal")
power_aov <- sum(result$anova<0.05)/1000
power_kruskal <- sum(result$kruskal<0.05)/1000
return(list("power_aov" = power_aov, "power_kruskal" = power_kruskal))
}
cal_power_cor(cor = 0.2)
cal_power_cor(cor = 0.5)
cal_power_cor(cor = 0.9)









share|improve this question































    0















    As the title, I want to test the robustness of one way anova when the independence assumption is violated. The scenario is there is correlation between different subgroups. I used mvrnorm to generate correlated datasets, and do the simulation for 1000 times to calculate power of anova for each correlation from 0.1 to 0.9.



    However it seems for each iteration the p value of anova stays constant. I hope to know the whether it is because of the way I used to generate correlated subsets is inappropriate or because of something else?? I find in fact the p value does change, but very very slightly... I expect to see when correlation increases, the power will decrease??, but in my case the power is either 1 or 0... (I have put sd for each group to be 1 for simplicity).



    myfun3 <- function(index, cor, samplesize, mean1, mean2, mean3){
    library(MASS)
    data = mvrnorm(n=samplesize, mu=c(mean1, mean2, mean3),
    Sigma=matrix(c(1, rep(cor, 3), 1, rep(cor, 3), 1), 3, 3), empirical=TRUE)
    newdat<- data.frame(data)
    library(tidyr)
    newdat_long <- gather(newdat, group, y, X1:X3, factor_key = TRUE)
    aov_result <- summary(aov(y ~ group, newdat_long))[[1]][["Pr(>F)"]][[1]]
    ks_result <- kruskal.test(y ~ group, newdat_long)$p.value
    return(list(aov_result, ks_result))
    }

    cal_power_cor <- function(iteration = 1000, samplesize=30, cor, mean1 = 10, mean2=10.5, mean3=10.8){
    result_list <- lapply(1:iteration, myfun3, samplesize = samplesize, cor = cor, mean1 = mean1, mean2 = mean2,
    mean3 = mean3)
    result <-data.frame(matrix(unlist(result_list), ncol = 2, byrow = T))
    colnames(result)<- c("anova", "kruskal")
    power_aov <- sum(result$anova<0.05)/1000
    power_kruskal <- sum(result$kruskal<0.05)/1000
    return(list("power_aov" = power_aov, "power_kruskal" = power_kruskal))
    }
    cal_power_cor(cor = 0.2)
    cal_power_cor(cor = 0.5)
    cal_power_cor(cor = 0.9)









    share|improve this question



























      0












      0








      0








      As the title, I want to test the robustness of one way anova when the independence assumption is violated. The scenario is there is correlation between different subgroups. I used mvrnorm to generate correlated datasets, and do the simulation for 1000 times to calculate power of anova for each correlation from 0.1 to 0.9.



      However it seems for each iteration the p value of anova stays constant. I hope to know the whether it is because of the way I used to generate correlated subsets is inappropriate or because of something else?? I find in fact the p value does change, but very very slightly... I expect to see when correlation increases, the power will decrease??, but in my case the power is either 1 or 0... (I have put sd for each group to be 1 for simplicity).



      myfun3 <- function(index, cor, samplesize, mean1, mean2, mean3){
      library(MASS)
      data = mvrnorm(n=samplesize, mu=c(mean1, mean2, mean3),
      Sigma=matrix(c(1, rep(cor, 3), 1, rep(cor, 3), 1), 3, 3), empirical=TRUE)
      newdat<- data.frame(data)
      library(tidyr)
      newdat_long <- gather(newdat, group, y, X1:X3, factor_key = TRUE)
      aov_result <- summary(aov(y ~ group, newdat_long))[[1]][["Pr(>F)"]][[1]]
      ks_result <- kruskal.test(y ~ group, newdat_long)$p.value
      return(list(aov_result, ks_result))
      }

      cal_power_cor <- function(iteration = 1000, samplesize=30, cor, mean1 = 10, mean2=10.5, mean3=10.8){
      result_list <- lapply(1:iteration, myfun3, samplesize = samplesize, cor = cor, mean1 = mean1, mean2 = mean2,
      mean3 = mean3)
      result <-data.frame(matrix(unlist(result_list), ncol = 2, byrow = T))
      colnames(result)<- c("anova", "kruskal")
      power_aov <- sum(result$anova<0.05)/1000
      power_kruskal <- sum(result$kruskal<0.05)/1000
      return(list("power_aov" = power_aov, "power_kruskal" = power_kruskal))
      }
      cal_power_cor(cor = 0.2)
      cal_power_cor(cor = 0.5)
      cal_power_cor(cor = 0.9)









      share|improve this question
















      As the title, I want to test the robustness of one way anova when the independence assumption is violated. The scenario is there is correlation between different subgroups. I used mvrnorm to generate correlated datasets, and do the simulation for 1000 times to calculate power of anova for each correlation from 0.1 to 0.9.



      However it seems for each iteration the p value of anova stays constant. I hope to know the whether it is because of the way I used to generate correlated subsets is inappropriate or because of something else?? I find in fact the p value does change, but very very slightly... I expect to see when correlation increases, the power will decrease??, but in my case the power is either 1 or 0... (I have put sd for each group to be 1 for simplicity).



      myfun3 <- function(index, cor, samplesize, mean1, mean2, mean3){
      library(MASS)
      data = mvrnorm(n=samplesize, mu=c(mean1, mean2, mean3),
      Sigma=matrix(c(1, rep(cor, 3), 1, rep(cor, 3), 1), 3, 3), empirical=TRUE)
      newdat<- data.frame(data)
      library(tidyr)
      newdat_long <- gather(newdat, group, y, X1:X3, factor_key = TRUE)
      aov_result <- summary(aov(y ~ group, newdat_long))[[1]][["Pr(>F)"]][[1]]
      ks_result <- kruskal.test(y ~ group, newdat_long)$p.value
      return(list(aov_result, ks_result))
      }

      cal_power_cor <- function(iteration = 1000, samplesize=30, cor, mean1 = 10, mean2=10.5, mean3=10.8){
      result_list <- lapply(1:iteration, myfun3, samplesize = samplesize, cor = cor, mean1 = mean1, mean2 = mean2,
      mean3 = mean3)
      result <-data.frame(matrix(unlist(result_list), ncol = 2, byrow = T))
      colnames(result)<- c("anova", "kruskal")
      power_aov <- sum(result$anova<0.05)/1000
      power_kruskal <- sum(result$kruskal<0.05)/1000
      return(list("power_aov" = power_aov, "power_kruskal" = power_kruskal))
      }
      cal_power_cor(cor = 0.2)
      cal_power_cor(cor = 0.5)
      cal_power_cor(cor = 0.9)






      r anova robustness






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Nov 16 '18 at 20:54







      Teng Li

















      asked Nov 16 '18 at 20:49









      Teng LiTeng Li

      84




      84
























          0






          active

          oldest

          votes












          Your Answer






          StackExchange.ifUsing("editor", function () {
          StackExchange.using("externalEditor", function () {
          StackExchange.using("snippets", function () {
          StackExchange.snippets.init();
          });
          });
          }, "code-snippets");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "1"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53345181%2ftest-robustness-of-one-way-anova-when-independence-assumption-is-violated%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          0






          active

          oldest

          votes








          0






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes
















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53345181%2ftest-robustness-of-one-way-anova-when-independence-assumption-is-violated%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Xamarin.iOS Cant Deploy on Iphone

          Glorious Revolution

          Dulmage-Mendelsohn matrix decomposition in Python