Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 8

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 27213 1748176.5911.34 40472 1715262.6411.13 15 112.980.86 15 112.980.86 7509 472188.5811.85 11056 465277.6711.68 18837 1218166.5110.77 28193 1192249.2210.54
aag 8964 56558.173.67 17294 497112.233.23 72 562.284.33 114 598.624.33 2340 14858.773.72 4493 132112.843.31 6243 39155.193.46 11995 344106.033.04
aat 38469 2571249.6416.68 57450 2502372.8116.24 15 112.980.86 15 112.980.86 9861 664247.6516.68 14735 648370.0616.27 26856 1788237.4015.80 40286 1740356.1215.38
acc 5946 42838.592.78 10749 39469.752.56 99 885.646.92 216 8186.856.92 1905 13747.843.44 3363 11984.462.99 3648 26132.252.31 6723 24659.432.17
acg000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00
act 1596 11610.360.75 2865 10818.590.70 0 00.000.00 0 00.000.00 504 3512.660.88 829 3120.820.78 1044 779.230.68 1988 7317.570.65
agc 5418 40435.162.62 7310 39747.442.58 333 24288.0720.76 504 24436.0020.76 1437 10436.092.61 1974 10349.582.59 3381 25629.892.26 4412 25039.002.21
agg 9111 70359.124.56 20195 614131.053.98 315 24272.5020.76 528 23456.7619.90 2631 20666.085.17 6411 185161.014.65 5661 43550.043.85 12273 373108.493.30
atc 7446 48448.323.14 14166 45391.932.94 165 12142.7410.38 318 10275.098.65 2034 13251.083.31 3365 13084.513.27 4866 31643.012.79 9954 29087.992.56
ccg 1767 12111.470.79 4082 11126.490.72 381 28329.5924.22 779 26673.8922.49 339 228.510.55 633 1915.900.48 915 628.090.55 2274 5720.100.50
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9601.03456 / 9 act, agc, ccg
number per megabase1.0000.08967 / 9 act, ccg
Coding regions
length per megabase0.9320.84945 / 9 aac, aag, aat, act
number per megabase0.9970.15445 / 9 aac, aag, aat, act
Introns
length per megabase0.9251.39656 / 9 act, agc, ccg
number per megabase1.0000.09867 / 9 act, ccg
Intergenic regions
length per megabase0.9621.01056 / 9 act, agc, ccg
number per megabase1.0000.09967 / 9 act, ccg