Abundance of perfect and imperfect trinucleotide repeats in human chromosome 18

ALLCODINGINTRONINTERGENIC
perfectimperfectperfectimperfectperfectimperfectperfectimperfect
TLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMbTLTNLPMbNPMb
aac 13779 923184.5612.36 20408 908273.3512.16 24 246.603.88 24 246.603.88 3681 243174.7511.54 5367 237254.7811.25 8859 597166.9011.25 13161 588247.9511.08
aag 4812 30964.454.14 10501 255140.653.42 36 369.895.83 53 3102.905.83 1467 9369.644.42 2656 69126.093.28 2949 18755.563.52 7212 157135.872.96
aat 21834 1375292.4518.42 31874 1335426.9317.88 0 00.000.00 0 00.000.00 5676 355269.4516.85 8056 342382.4416.24 14553 913274.1817.20 21439 888403.9116.73
acc 4023 29953.884.00 9736 219130.412.93 66 3128.145.83 96 3186.385.83 1707 12381.045.84 4171 68198.013.23 1830 14234.482.67 4895 11992.222.24
acg 45 30.600.04 62 30.830.04 0 00.000.00 0 00.000.00 18 10.850.05 32 11.520.05 15 10.280.02 15 10.280.02
act 738 499.880.66 1332 4717.840.63 0 00.000.00 0 00.000.00 177 118.400.52 319 1115.140.52 510 359.610.66 933 3317.580.62
agc 3333 24744.643.31 4262 24557.093.28 186 15361.1229.12 410 15796.0229.12 867 6041.162.85 1052 6049.942.85 1962 14836.962.79 2382 14644.882.75
agg 6318 47584.626.36 15302 388204.965.20 132 10256.2819.41 285 10553.3319.41 1938 14592.006.88 4729 122224.505.79 3669 27669.125.20 8768 227165.194.28
atc 3552 23947.583.20 6535 22287.532.97 60 5116.499.71 90 5174.749.71 864 5741.022.71 1569 5474.482.56 2268 15342.732.88 4352 14281.992.67
ccg 1260 8216.881.10 3649 7548.881.00 156 11302.8821.36 763 111481.3821.36 51 42.420.19 95 44.510.19 795 5014.980.94 2083 4339.240.81
TL: total length of repeats in class; TN: total number of repeats in class;
LPMb: length per megabase of repeats in class; NPMb: number of repeats in class per megabase

Statistical analysis (contingency test results)

Data points with the smallest value were exluded iteratively until all remaining data contributes more than 5% of the total. Data sets were normalized to 100%

FeatureProbabilityChiSquareDegrees of freedomused / all data pointsExcluded repeats
All sequences
length per megabase0.7982.35656 / 10 acg, act, agc, ccg
number per megabase0.9990.43167 / 10 acg, act, ccg
Coding regions
length per megabase0.0439.86845 / 10 aac, aag, aat, acg, act
number per megabase1.0000.00056 / 10 aac, aat, acg, act
Introns
length per megabase0.7522.66256 / 10 acg, act, agc, ccg
number per megabase0.9641.42467 / 10 acg, act, ccg
Intergenic regions
length per megabase0.7472.69656 / 10 acg, act, agc, ccg
number per megabase1.0000.23667 / 10 acg, act, ccg