Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 20

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 10833 683166.0910.47 15961 672244.7210.30 0 00.000.00 0 00.000.00 2949 186150.859.52 4410 184225.599.41 7239 455161.4810.15 10579 446235.989.95
aag 3873 24959.383.82 8683 195133.132.99 69 581.875.93 117 5138.825.93 1089 6755.713.43 2847 50145.632.56 2493 16355.613.64 5393 126120.302.81
aat 16533 1067253.4916.36 24167 1044370.5416.01 0 00.000.00 0 00.000.00 4770 302244.0015.45 6868 298351.3215.24 10761 696240.0415.53 15906 680354.8115.17
acc 3180 23848.763.65 5661 22686.803.46 36 342.713.56 93 3110.343.56 1029 7652.643.89 1777 7490.903.79 1893 14342.233.19 3509 13378.272.97
acg 36 30.550.05 36 30.550.05 12 114.241.19 12 114.241.19 0 00.000.00 0 00.000.00 24 20.540.04 24 20.540.04
act 510 357.820.54 969 3314.860.51 0 00.000.00 0 00.000.00 168 128.590.61 254 1112.990.56 330 227.360.49 703 2115.680.47
agc 3519 25953.953.97 4700 25272.063.86 462 33548.1639.15 834 31989.5336.78 1011 7651.723.89 1240 7463.433.79 1725 12738.482.83 2127 12647.452.81
agg 4974 35576.265.44 9833 319150.764.89 228 18270.5221.36 476 18564.7721.36 1452 11074.285.63 2863 100146.455.12 3006 20467.054.55 6068 178135.353.97
atc 3012 20146.183.08 6511 18999.832.90 12 114.241.19 12 114.241.19 891 5645.582.87 1541 5278.832.66 1920 13042.832.90 4699 122104.822.72
ccg 1221 8018.721.23 3245 7049.751.07 228 16270.5218.98 557 14660.8716.61 210 1310.740.67 510 1226.090.61 636 4314.190.96 1880 3741.940.82
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9601.49467 / 10 acg, act, ccg
number per megabase1.0000.18867 / 10 acg, act, ccg
Coding regions
length per megabase0.8380.85034 / 10 aac, aat, acc, acg, act, atc
number per megabase0.9900.11734 / 10 aac, aat, acc, acg, act, atc
Introns
length per megabase0.9192.00867 / 10 acg, act, ccg
number per megabase1.0000.25967 / 10 acg, act, ccg
Intergenic regions
length per megabase0.8891.70256 / 10 acg, act, agc, ccg
number per megabase1.0000.23367 / 10 acg, act, ccg