Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 11

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 23379 1522165.8310.80 34761 1490246.5710.57 24 212.241.02 30 215.301.02 7296 469161.7210.40 10805 466239.5010.33 14868 975158.3310.38 22175 946236.1510.07
aag 6651 44847.183.18 14307 404101.482.87 321 26163.7613.26 437 26222.9413.26 1992 13744.153.04 4086 13090.572.88 4044 26443.072.81 9231 22898.302.43
aat 38133 2514270.4917.83 57081 2462404.8917.46 0 00.000.00 0 00.000.00 12918 825286.3318.29 18653 809413.4517.93 22761 1524242.3916.23 34749 1494370.0515.91
acc 6234 46644.223.31 10769 43976.393.11 246 19125.509.69 451 18230.089.18 2103 15546.613.44 3364 15574.563.44 3357 25235.752.68 5736 22961.082.44
acg000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00000.000.00
act 1233 888.750.62 2029 8414.390.60 0 00.000.00 0 00.000.00 474 3510.510.78 833 3318.460.73 711 497.570.52 1124 4711.970.50
agc 6144 44643.583.16 7788 44155.243.13 540 40275.4920.41 837 40427.0020.41 2007 14744.493.26 2487 14355.123.17 3189 23233.962.47 3953 23142.102.46
agg 9726 73968.995.24 18204 692129.134.91 579 44295.3822.45 1027 43523.9321.94 2859 21963.374.85 5266 208116.724.61 5595 42259.584.49 10496 390111.784.15
atc 6915 47449.053.36 14081 44199.883.13 228 18116.329.18 379 18193.359.18 2232 15149.473.35 4162 14192.253.12 4047 27643.102.94 8958 25795.402.74
ccg 1992 13614.130.96 4787 13333.960.94 495 37252.5318.88 1249 36637.1918.37 402 268.910.58 1197 2626.530.58 906 599.650.63 1903 5720.270.61
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9700.89856 / 9 act, agc, ccg
number per megabase1.0000.03567 / 9 act, ccg
Coding regions
length per megabase0.8382.08256 / 9 aac, aat, act
number per megabase1.0000.01556 / 9 aac, aat, act
Introns
length per megabase0.9910.83467 / 9 act, ccg
number per megabase1.0000.01967 / 9 act, ccg
Intergenic regions
length per megabase0.9491.15756 / 9 act, agc, ccg
number per megabase1.0000.06967 / 9 act, ccg