Abundance of perfect and imperfect trinucleotide repeats in chimpanzee chromosome 16

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 14877 949190.6212.16 23041 929295.2211.90 0 00.000.00 0 00.000.00 4314 280189.2612.28 6447 275282.8312.06 9855 621182.5311.50 15496 608287.0111.26
aag 4854 31362.194.01 10935 271140.113.47 102 880.816.34 141 8111.706.34 1275 8155.943.55 3242 71142.233.12 3261 21060.403.89 6851 180126.893.33
aat 24459 1533313.3919.64 37021 1480474.3418.96 0 00.000.00 0 00.000.00 6972 426305.8618.69 9965 408437.1717.90 16206 1023300.1618.95 25201 992466.7718.37
acc 3966 29750.813.81 7694 28098.583.59 129 9102.207.13 217 9171.917.13 1425 10162.524.43 2464 92108.104.04 2187 17040.513.15 4678 16286.643.00
acg 24 20.310.03 33 20.420.03 12 19.510.79 12 19.510.79 12 10.530.04 21 10.920.04 0 00.000.00 0 00.000.00
act 711 499.110.63 1763 4322.590.55 12 19.510.79 12 19.510.79 177 127.760.53 538 1123.600.48 495 349.170.63 1150 2921.300.54
agc 4539 32858.164.20 6360 31681.494.05 642 43508.6134.07 1312 371039.4029.31 1212 9353.174.08 1554 9068.173.95 2403 17144.513.17 3100 16857.423.11
agg 7176 54691.947.00 17376 489222.636.26 471 35373.1427.73 1124 33890.4726.14 1797 13978.836.10 3409 137149.556.01 4509 34183.526.32 12178 290225.565.37
atc 5196 35566.584.55 14376 295184.203.78 27 221.391.58 36 228.521.58 1086 6547.642.85 2558 58112.222.54 3924 27772.685.13 11488 225212.784.17
ccg 2067 13726.481.75 5446 12769.781.63 354 26280.4520.60 739 23585.4618.22 480 3021.061.32 1334 2858.521.23 1032 6619.111.22 2759 6151.101.13
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.8522.64867 / 10 acg, act, ccg
number per megabase1.0000.13067 / 10 acg, act, ccg
Coding regions
length per megabase0.9320.44034 / 10 aac, aag, aat, acg, act, atc
number per megabase0.9970.14745 / 10 aac, aat, acg, act, atc
Introns
length per megabase0.9152.04967 / 10 acg, act, ccg
number per megabase1.0000.06267 / 10 acg, act, ccg
Intergenic regions
length per megabase0.6723.18056 / 10 acg, act, agc, ccg
number per megabase1.0000.22767 / 10 acg, act, ccg