03-26-2024, 07:26 AM
I think I notices some bad bug in Admixtools 2 for group statistics.
( from library("admixtools"))
Running the same test for the same groups: I got different results for each run.
It doesn't make much sense , unless if there is some random algorithm for simplification of calculations.
Test population:
TEST
[1] "Turkey_N" "Iran_GanjDareh_N" "Georgia_Kotias.SG" "Russia_Sidelkino_HG.SG"
[5] "Tajikistan_BA_DashtiKozy" "Greece_Minoan_Lassithi" "Greece_Minoan_Kephala_Petras.SG" "Greece_Minoan_Odigitria"
[9] "Italy_South_HG_Ostuni2" "Russia_Ust_Ishim.DG" "Denisova.DG" "Vindija_Neanderthal.DG"
[13] "Papuan.SDG" "Greece_BA_Mycenaean_Pylos" "Greece_BA_Mycenaean" "Micronesia_Pohnpei_400BP"
[17] "Chimp.REF" "JPT.SG" "Mongolia_Salkhit_UP.SG" "Japan_HG_Jomon"
>
This is the test:
f4( prefix, TEST , "Russia_Ust_Ishim.DG", "Denisova.DG" , "Vindija_Neanderthal.DG" ) #
prefix -> is the path to the dataset v52.2_1240K : in my case in another subfolder:
"../v52.2/geno/v52.2_1240K_public"
To run the test just execute the command in R: f4( prefix, TEST , "Russia_Ust_Ishim.DG", "Denisova.DG" , "Vindija_Neanderthal.DG" )
This is the activity log in real time:
i Getting population combinations...
i 20 population combinations found
i Computing from f4 from genotype data...
i Reading metadata...
i Computing block lengths for 1150639 SNPs...
i Computing 20 f4-statistics for block 713 out of 713...
i Summarize across blocks...
Next: the results:
( from library("admixtools"))
Running the same test for the same groups: I got different results for each run.
It doesn't make much sense , unless if there is some random algorithm for simplification of calculations.
Test population:
TEST
[1] "Turkey_N" "Iran_GanjDareh_N" "Georgia_Kotias.SG" "Russia_Sidelkino_HG.SG"
[5] "Tajikistan_BA_DashtiKozy" "Greece_Minoan_Lassithi" "Greece_Minoan_Kephala_Petras.SG" "Greece_Minoan_Odigitria"
[9] "Italy_South_HG_Ostuni2" "Russia_Ust_Ishim.DG" "Denisova.DG" "Vindija_Neanderthal.DG"
[13] "Papuan.SDG" "Greece_BA_Mycenaean_Pylos" "Greece_BA_Mycenaean" "Micronesia_Pohnpei_400BP"
[17] "Chimp.REF" "JPT.SG" "Mongolia_Salkhit_UP.SG" "Japan_HG_Jomon"
>
This is the test:
f4( prefix, TEST , "Russia_Ust_Ishim.DG", "Denisova.DG" , "Vindija_Neanderthal.DG" ) #
prefix -> is the path to the dataset v52.2_1240K : in my case in another subfolder:
"../v52.2/geno/v52.2_1240K_public"
To run the test just execute the command in R: f4( prefix, TEST , "Russia_Ust_Ishim.DG", "Denisova.DG" , "Vindija_Neanderthal.DG" )
This is the activity log in real time:
i Getting population combinations...
i 20 population combinations found
i Computing from f4 from genotype data...
i Reading metadata...
i Computing block lengths for 1150639 SNPs...
i Computing 20 f4-statistics for block 713 out of 713...
i Summarize across blocks...
Next: the results: