Abundance of perfect and imperfect trinucleotide repeats in human chromosome 5

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 32319 2115181.8811.90 48033 2081270.3111.71 48 329.841.86 51 331.701.86 9525 616184.0211.90 13666 607264.0211.73 20529 1354165.1210.89 31037 1331249.6410.71
aag 9711 63454.653.57 20667 536116.313.02 288 23179.0314.30 417 23259.2314.30 1812 13435.012.59 3332 12664.372.43 7296 45358.683.64 16301 363131.112.92
aat 51957 3340292.3918.80 75688 3256425.9418.32 12 17.460.62 15 19.320.62 14487 924279.8917.85 20939 896404.5417.31 34482 2219277.3517.85 50419 2168405.5317.44
acc 8100 58945.583.31 18071 498101.702.80 246 18152.9211.19 380 17236.2210.57 2646 19251.123.71 6905 161133.403.11 4647 33837.382.72 9719 28278.172.27
acg 12 10.070.01 54 10.300.01 12 17.460.62 54 133.570.62 0 00.000.00 0 00.000.00 0 00.000.00 0 00.000.00
act 1986 13111.180.74 3390 11319.080.64 0 00.000.00 0 00.000.00 666 4612.870.89 1271 3424.550.66 1182 769.510.61 1894 7115.230.57
agc 6717 49937.802.81 8665 49248.762.77 612 45380.4427.97 1001 42622.2626.11 1929 14637.272.82 2481 14547.932.80 3825 28130.772.26 4636 27937.292.24
agg 10080 75956.734.27 19586 673110.223.79 324 24201.4114.92 657 23408.4214.30 2757 21153.274.08 5114 17798.803.42 6288 46950.583.77 12798 420102.943.38
atc 9369 62452.733.51 18027 577101.453.25 249 19154.7911.81 408 16253.639.95 2931 19056.633.67 5337 181103.113.50 5769 38646.403.10 11638 35293.612.83
ccg 2730 18215.361.02 8374 16547.120.93 771 53479.2932.95 2514 461562.8128.60 393 247.590.46 1003 1919.380.37 1050 708.450.56 3616 6529.080.52
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.9341.31356 / 10 acg, act, agc, ccg
number per megabase1.0000.14767 / 10 acg, act, ccg
Coding regions
length per megabase0.3475.60556 / 10 aac, aat, acg, act
number per megabase1.0000.15056 / 10 aac, aat, acg, act
Introns
length per megabase0.8831.74856 / 10 acg, act, agc, ccg
number per megabase1.0000.15467 / 10 acg, act, ccg
Intergenic regions
length per megabase0.9231.41556 / 10 acg, act, agc, ccg
number per megabase1.0000.21367 / 10 acg, act, ccg