Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 9

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 21483 1370186.5011.89 31122 1351270.1811.73 24 219.041.59 26 220.631.59 6369 394189.4711.72 9107 390270.9311.60 13977 905174.0311.27 20262 891252.2811.09
aag 5610 38148.703.31 12630 350109.643.04 105 883.316.35 156 8123.776.35 1524 10945.343.24 3346 10499.543.09 3735 24646.503.06 8666 221107.902.75
aat 32148 2105279.0818.27 47119 2040409.0517.71 0 00.000.00 0 00.000.00 9051 586269.2617.43 13164 566391.6216.84 21672 1424269.8317.73 31815 1383396.1217.22
acc 4881 36242.373.14 7701 35466.853.07 153 11121.398.73 270 11214.218.73 1602 11947.663.54 2450 11572.893.42 2787 20834.702.59 4545 20456.592.54
acg000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00
act 969 698.410.60 1848 6816.040.59 12 19.520.79 12 19.520.79 420 3012.490.89 717 3021.330.89 468 355.830.44 1029 3412.810.42
agc 4746 35341.203.06 6270 34554.433.00 375 29297.5223.01 634 29503.0023.01 1614 12348.023.66 2068 12261.523.63 2412 17430.032.17 3049 17137.962.13
agg 6717 51058.314.43 14096 462122.374.01 216 17171.3713.49 301 17238.8113.49 1980 15558.904.61 3911 143116.354.25 4179 31352.033.90 9332 280116.193.49
atc 5160 33644.802.92 9070 32578.742.82 132 9104.737.14 192 9152.337.14 1665 10449.533.09 2765 9982.262.94 3135 20739.032.58 5745 20171.532.50
ccg 1953 13016.951.13 5350 12346.451.07 297 22235.6317.45 551 21437.1516.66 645 3919.191.16 1899 3856.491.13 897 6111.170.76 2673 5633.280.70
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9451.19956 / 9 act, agc, ccg
number per megabase1.0000.03467 / 9 act, ccg
Coding regions
length per megabase0.9900.54856 / 9 aac, aat, act
number per megabase1.0000.01956 / 9 aac, aat, act
Introns
length per megabase0.9851.02367 / 9 act, ccg
number per megabase1.0000.02167 / 9 act, ccg
Intergenic regions
length per megabase0.9131.50556 / 9 act, agc, ccg
number per megabase1.0000.05067 / 9 act, ccg