Abundance of perfect and imperfect trinucleotide repeats in human chromosome 17

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 16737 1070210.8513.48 25034 1049315.3813.21 36 318.821.57 36 318.821.57 7287 462223.0714.14 10989 455336.3913.93 8562 548191.1312.23 12758 536284.8011.96
aag 5352 31967.424.02 11924 266150.223.35 282 23147.3912.02 745 22389.3811.50 1551 10347.483.15 3335 90102.092.75 3297 17973.604.00 7436 142166.003.17
aat 26448 1580333.1919.91 38172 1528480.8919.25 0 00.000.00 0 00.000.00 10764 629329.5019.25 15574 607476.7418.58 14226 857317.5719.13 20514 827457.9418.46
acc 7698 52796.986.64 21912 329276.054.14 336 20175.6110.45 791 17413.438.88 3903 255119.487.81 8246 139252.424.25 2922 22265.234.96 11873 149265.043.33
acg 66 50.830.06 78 50.980.06 39 320.381.57 48 325.091.57 15 10.460.03 15 10.460.03 12 10.270.02 15 10.340.02
act 681 508.580.63 1934 4824.360.60 12 16.270.52 12 16.270.52 318 239.730.70 734 2122.470.64 327 247.300.54 1149 2425.650.54
agc 5715 40772.005.13 7723 39997.305.03 1209 83631.9043.38 1976 781032.7840.77 1887 13357.764.07 2329 13171.294.01 2145 15747.883.50 2680 15659.833.48
agg 9951 748125.369.42 23048 658290.368.29 834 63435.9032.93 1651 55862.9128.75 3549 272108.648.33 7117 244217.867.47 4833 359107.898.01 12984 307289.856.85
atc 4905 30661.793.85 13238 246166.773.10 168 1387.816.79 201 13105.066.79 1407 9243.072.82 3325 83101.782.54 2982 17866.573.97 8253 132184.232.95
ccg 3540 23744.602.99 9477 227119.392.86 660 48344.9625.09 1715 46896.3724.04 303 229.280.67 683 2220.910.67 1731 10938.642.43 4749 102106.012.28
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.6923.88967 / 10 acg, act, ccg
number per megabase0.9910.84567 / 10 acg, act, ccg
Coding regions
length per megabase0.7591.87345 / 10 aac, aat, acg, act, atc
number per megabase1.0000.10656 / 10 aac, aat, acg, act
Introns
length per megabase0.8431.40845 / 10 acg, act, agc, atc, ccg
number per megabase0.9061.55856 / 10 acg, act, atc, ccg
Intergenic regions
length per megabase0.2506.62156 / 10 acg, act, agc, ccg
number per megabase0.9940.71967 / 10 acg, act, ccg