Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 3

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 38232 2460181.1611.66 57296 2400271.5011.37 24 213.051.09 33 217.951.09 12252 788187.4412.06 18451 774282.2811.84 23475 1511163.2110.51 35184 1471244.6210.23
aag 11055 71552.383.39 21760 621103.112.94 225 18122.359.79 322 18175.109.79 2478 17337.912.65 4853 16174.252.46 7719 47953.673.33 15346 404106.692.81
aat 55047 3633260.8417.21 82426 3534390.5816.75 24 213.051.09 30 216.311.09 17334 1128265.1917.26 25438 1097389.1816.78 35232 2339244.9516.26 53557 2275372.3515.82
acc 8289 61939.282.93 15298 58272.492.76 213 17115.839.24 298 17162.059.24 3093 22847.323.49 6233 20095.363.06 4557 34131.682.37 7841 33254.512.31
acg 27 20.130.01 27 20.130.01 15 18.160.54 15 18.160.54 12 10.180.01 12 10.180.01 0 00.000.00 0 00.000.00
act 2145 15710.160.74 3452 15016.360.71 12 16.530.54 12 16.530.54 753 5411.520.83 1258 5319.250.81 1275 948.860.65 2020 8814.040.61
agc 7260 53334.402.53 9867 52946.762.51 561 41305.0722.30 1034 41562.2822.30 2046 15331.302.34 2667 15240.802.33 4035 29828.052.07 5400 29537.542.05
agg 10554 79750.013.78 21634 740102.513.51 465 34252.8618.49 753 32409.4717.40 3189 24248.793.70 5656 23186.533.53 6186 47043.013.27 13799 43095.942.99
atc 7941 53537.632.54 12787 50860.592.41 102 855.474.35 165 889.724.35 2445 16737.412.56 3653 16155.892.46 4947 32734.392.27 8121 30756.462.13
ccg 2277 15810.790.75 5777 14727.380.70 522 39283.8621.21 1057 37574.7920.12 408 296.240.44 1235 2618.890.40 990 656.880.45 2692 5918.720.41
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9830.69456 / 10 acg, act, agc, ccg
number per megabase1.0000.05367 / 10 acg, act, ccg
Coding regions
length per megabase0.9310.85545 / 10 aac, aat, acg, act, atc
number per megabase1.0000.04056 / 10 aac, aat, acg, act
Introns
length per megabase0.9870.61956 / 10 acg, act, agc, ccg
number per megabase1.0000.04767 / 10 acg, act, ccg
Intergenic regions
length per megabase0.9740.84256 / 10 acg, act, agc, ccg
number per megabase1.0000.08467 / 10 acg, act, ccg