G338841 (col1a1,col1a1a,LOC107585052,LOC107740634,LOC108279263)
Chromosome Information
| Item |
Value |
| chromosome id |
NW_019172936.1 |
| NCBI id |
APWO02000145.1
|
| chromosome length |
1987809 |
| location |
1878449 ~ 1955333 (+) |
| genome version |
Astyanax_mexicanus-2.0_2017_mexican_tetra_Genome |
Sequence
>TU466298
ACAAGGGGAAAAggGGCACTGCCATTGAAGTAACCATTGAACAACACGAGGGGGACTTGGTAAAGAAACCAAAAAAGACAACAACCAAAAAGTGTCAAAACTCAAATCTGTTGGGGTGAAAGACTCCACCTGCTTATTGCTACACTTGAAAAGACTATGTGCATTAAAGCTGCGTTGCTATAAAGCCCTGCTGCAAAAAAATGCAGCAAAAACACATTAGCAAGAGAGGCATTTGTTACAACTGGGCAGCAAGATCATttgttttttgcctttttttagcACCAAAGGCCCTGTGAGCAAGAGTGACAAACAATACATTactttttcaattaaaaaagcacAACTCTAACACACCCACTTACAGGACCTCATGATATAGCACCTTTTGCAGCTGTTACTTGCTTCTTTTAAACAATACTTCCGCAATGTAAAATCCTAACCTGGTGTTGGTCAACACTGGGTAATAACGTTGGTTTCTTTTCTGAGGTAGCCATAAGATATCACAACAAAGTCAAACCCCTGTCTCTTATTAGTTCTGTTCTTTCTGCGATGCTCGAAAGGAACAATGATGCTTCGTGCCTGCTGGACACTTGTTCTGATGGGTAGAACACAAAGACACTTAAATATCATTTTACAGATTTATTCATCACCTGAACATGGTACAAAGCATTTGGTTCTTGTTTTCATTGTCCTTCCGCAGtggtaaaaaaaagtcaccTGTATCTGTATTCAGTTTTTTAGCTCCAGGGGGATTTTACGCTTTAcaagtatatattattttcttgtTGCCAGTGAAACAAATGTCTCCTCGGTGTTGGTGGTGGTAAAATAGTGGTCCCAAAAAAAAGCTTGATAACACACAACCACTGTCTTCTTAAGCACAGTTCTAAGCAAGTGGACCATTCGGACAGCCAAATCGGACTCtagggctttttttttaattgcatagAAAAGATTTTAGAGAGATGACTgctagaaaaaacacacacacatcacacaaacAATTTCAAGtccagatttctttttttacaagaaGCAGACTGGGCCAACTTCAATGCCAAATTCCTGATTGGGTGCACCAACGTCCATAGGAGCGATGTCAATAATAGGCAGCCGAGATGTTTTCGTTGTTTTGTAGTCGATGACTGTCTTGCCCCATGCACCAGTGTGTGACTGCAAgaattgaaaaaagaaaaagtacaaTT
>TU466303
CATGACTATATAAGAGCTCTTATATTACGCACCGTGCAACCATCCTCAGTGACGCTGTATGTGAAGCGGCTGTTGCCCTCTGCTCTGATCTCGATCTCATTGGATCCCTGGAGCAGCAGAGCCTTCTTAAGATTGCCAGAGGCCTGGTCCATGTAGGCAATGCTGTTCTTGCAGTGGTATGTGATGTTCTGGGAGGCCTCAGTGGACATGAGACGCAGGAAGGTGAGCTGGATGTTGACATCCTCGGGCTTGGATTCCTCACCACCATATTCGAACTGGAAGCCATCGGTCATGGCCTCTCCGAACCAGACGTGCTTCTTCTCCTTGATGTTCTTGCTTGTGTACCAGTTCTTCTGAGGAATGTCGGACTGTGATGGGTAGACGCAGGTCTCGCCGGTCTCCATGTTACAGTAGACCTTGATGGCATCCTGGTTGCAGCCCTGGTCAGGGTCAATCCAGTGTCAGGGTGGCACATCTTCAGGTCGCGGCAGGTGCGGGCGGGGTTCTTCTTGGTGCCATCAGGGCTGCGGATGCTCTCAATCTGCTGGCTCAGGGACTTGAGGGTGGTGTCAACCTCCAGGTCACGGTCGCGCATCACATTGGCGTCATCAGCACGGAAGTGACGGAAGGGATCGGGTGCCTTCTCCTGTGGCTGGGCAATGAAGCCAATGTCGAATCCACCACCAGAGGGTCCAGGAGGTCCAGGGGGTCCGGGAGGTCCAGGAGGTCCCTAAAACAAAGGATGATGGAAGTAGGGGTGTAAAGTAGGTGCCTGATATATGGTCtgaacactttaaaatgagaagaagaaagagtATCTCACAGCAGGTCCAATTTCACCACTGCGACCGCGAGGTCCAGGGGGTCCAATGGGTCCAGGAAGTCCACTCATGCCATCCTTACCAGGGGTACCAGCAGATCCAGAAGGTCCCTATTGTTGTACAGAAGGAAACAATGACTTTATCTTCACTCAGCTTCAGGCAAGTGTCTCCTGGTATTGATATTGGTGCCAATTTATTATCCATGAAAAGgaatagaagaagaaagagaaatactTACTCTAGGTCCAGCAGGGCCAGATGATCCAGCGGGTCCAGACTCTCCAGAAGGTCCAGCCAGTCCAGGGGGTCCCATGGGTCCAGGAGGTCCACGCTCGCCACCGGGTCCACCAGGTCCCTGCTTGCCAGGCTCTCCCTAGTAGAGGAAAGAGGTTGGTTGGAT
>TU466304
AGCCATTTCTGATACTTTCTTTAATCTTCCTTTTTTAGTCCAAACAACTGTAACAGGTGCAAGTTGTGCTTCATTCAAGTCCATTGGGCAATCAGGGATGAGTCAAGATCACTTCTGACAAACATAAGGTCGACCACTTAACATGCTTTATTTCAGcattcaaaataattaaaaacatctcaaattattatacatatacaaaatagGTACAGATCCTCTTGTTTTAACCCCCCTATTGTGTGTCTGCTTTCTCACTAGTGTTTAGTTGTTGGAATTCTCTATTTCTCTTGCTATTCTCTTTGGAAAACCTTAAAAACCCttgtcaaaaaaacaaaacaaaaacaacaaattccaagaaaaggagaaaagggGGCACTGCCATTGAAGTAACCATTGAACAACACGAGGGGGACTTGGTAAAGAAACCAAAAAAGACAACAACCAAAAAGTGTCAAAACTCAAATCTGTTGGGGTGAAAGACTCCACCTGCTTATTGCTACACTTGAAAAGACTATGTGCATTAAAGCTGCGTTGCTATAAAGCCCTGCTGCAAAAAAATGCAGCAAAAACACATTAGCAAGAGAGGCATTTGTTACAACTGGGCAGCAAGATCATttgttttttgcctttttttagcACCAAAGGCCCTGTGAGCAAGAGTGACAAACAATACATTactttttcaattaaaaaagcacAACTCTAACACACCCACTTACAGGACCTCATGATATAGCACCTTTTGCAGCTGTTACTTGCTTCTTTTAAACAATACTTCCGCAATGTAAAATCCTAACCTGGTGTTGGTCAACACTGGGTAATAACGTTGGTTTCTTTTCTGAGGTAGCCATAAGATATCACAACAAAGTCAAACCCCTGTCTCTTATTAGTTCTGTTCTTTCTGCGATGCTCGAAAGGAACAATGATGCTTCGTGCCTGCTGGACACTTGTTCTGATGGGTAGAACACAAAGACACTTAAATATCATTTTACAGATTTATTCATCACCTGAACATGGTACAAAGCATTTGGTTCTTGTTTTCATTGTCCTTCCGCAGtggtaaaaaaaagtcaccTGTATCTGTATTCAGTTTTTTAGCTCCAGGGGGATTTTACGCTTTAcaagtatatattattttcttgtTGCCAGTGAAACAAATGTCTCCTCGGTGTTGGTGGTGGTAAAATAGTGGTCCCAAAAAAAAGCTTGATAACACACAACCACTGTCTTCTTAAGCACAGTTCTAAGCAAGTGGACCATTCGGACAGCCAAATCGGACTCtagggctttttttttaattgcatagAAAAGATTTTAGAGAGATGACTgctagaaaaaacacacacacatcacacaaacAATTTCAAGtccagatttctttttttacaagaaGCAGACTGGGCCAACTTCAATGCCAAATTCCTGATTGGGTGCACCAACGTCCATAGGAGCGATGTCAATAATAGGCAGCCGAGATGTTTTCGTTGTTTTGTAGTCGATGACTGTCTTGCCCCATGCACCAGTGTGTGACTGCAACCATCCTCAGTGACGCTGTATGTGAAGCGGCTGTTGCCCTCTGCTCTGATCTCGATCTCATTGGATCCCTGGAGCAGCAGAGCCTTCTTAAGATTGCCAGAGGCCTGGTCCATGTAGGCAATGCTGTTCTTGCAGTGGTATGTGATGTTCTGGGAGGCCTCAGTGGACATGAGACGCAGGAAGGTGAGCTGGATGTTGACATCCTCGGGCTTGGATTCCTCACCACCATATTCGAACTGGAATGTCGGACTGTGATGGGTAGACGCAGGTCTCGCCGGTCTCCATGTTACAGTAGACCTTGATGGCATCCTGGTTGCAGCCCTGGTCAGGGTCAATCCAGTACTCTCCTCTTCCAGTCAGGGTGGCACATCTTCAGGTCGCGGCAGGTGCGGGCGGGGTTCTTCTTGGTGCCATCAGGGCTGCGGATGCTCTCAATCTGCTGGCTCAGGGACTTGAGGGTGGTGTCAACCTCCAGGTCACGGTCGCGCATCACATTGGCGTCATCAGCACGGAAGTGACGGAAGGGATCGGGTGCCTTCTCCTGTGGCTGGGCAATGAAGCCAATGTCGAATCCACCACCAGAGGGTCCAGGAGGTCCAGGGGGTCCAGGGGGTCCCTGCATTCCAGTGAATCCTCTGTGTCCCTTCATGCCTCTTTCTCCAGCCTCTCCAGTCTCACCCTTGTCTCCACGGGCTCCAGCAGGTCCCTGTA
Function
| symbol |
description |
|
col1a1a
|
Predicted to be an extracellular matrix structural constituent. Acts upstream of or within several processes, including bone mineralization involved in bone maturation; fin development; and fin regeneration. Located in cytoplasm. Is expressed in several structures, including head; integument; myotome; pectoral fin; and vertebra. Used to study osteogenesis imperfecta and osteogenesis imperfecta type 3. Human ortholog(s) of this gene implicated in several diseases, including Ehlers-Danlos syndrome arthrochalasia type 1; aggressive periodontitis; bone disease (multiple); cutaneous leishmaniasis; and dentinogenesis imperfecta. Orthologous to human COL1A1 (collagen type I alpha 1 chain).
|
|
col1a1
|
Enables identical protein binding activity; platelet-derived growth factor binding activity; and protease binding activity. Involved in several processes, including animal organ morphogenesis; collagen biosynthetic process; and collagen fibril organization. Acts upstream of or within skeletal system development. Located in extracellular space. Part of collagen type I trimer. Colocalizes with collagen-containing extracellular matrix. Implicated in several diseases, including Ehlers-Danlos syndrome arthrochalasia type 1; aggressive periodontitis; bone disease (multiple); cutaneous leishmaniasis; and dentinogenesis imperfecta. Biomarker of several diseases, including alcoholic hepatitis; autoimmune disease (multiple); chronic obstructive pulmonary disease; end stage renal disease; and lupus nephritis.
|
NR:
| description |
| PREDICTED: collagen alpha-1(I) chain isoform X2 |
GO: NA
KEGG: NA
RNA
| RNA id |
representative |
length |
rna type |
GC content |
exon number |
start site |
end site |
| TU466298 |
False |
1205 |
lncRNA |
0.40 |
2 |
1878449 |
1952537 |
| TU466303 |
False |
1221 |
lncRNA |
0.54 |
4 |
1952614 |
1955333 |
| TU466304 |
True |
2247 |
TUCP |
0.45 |
5 |
1950976 |
1954313 |
Neighbor
| gene id |
symbol |
gene type |
direction |
distance |
location |
| rapgefl1 |
rapgefl1,LOC107662398,LOC107667751,LOC107740631,LOC107585050
|
coding |
upstream |
25597 |
1806923 ~ 1852852 (+) |
| casc3 |
casc3
|
coding |
upstream |
96451 |
1770290 ~ 1781998 (+) |
| LOC111192052 |
msl1
|
coding |
upstream |
110501 |
1763289 ~ 1767948 (+) |
| LOC103042716 |
LOC108412287
|
coding |
upstream |
143834 |
1727451 ~ 1734615 (+) |
| igf2bp1 |
igf2bp1,LOC108412303,LOC107717213,LOC107582884,LOC107667758,LOC106607494
|
coding |
upstream |
154331 |
1662780 ~ 1724118 (+) |
| G338826 |
NA
|
non-coding |
upstream |
18082 |
1854597 ~ 1860367 (+) |
| G338817 |
NA
|
non-coding |
upstream |
177373 |
1700784 ~ 1701076 (+) |
| G338762 |
NA
|
non-coding |
upstream |
271430 |
1606626 ~ 1607019 (+) |
| G338754 |
hoxb9,hoxb9a,LOC107585091,LOC107662432
|
non-coding |
upstream |
321818 |
1554515 ~ 1556631 (+) |
| G338748 |
hoxb7,hxb7a,hoxb7a,LOC107687793,LOC107582958,LOC107717221
|
non-coding |
upstream |
335435 |
1542137 ~ 1543014 (+) |
| LOC111192066 |
NA
|
other |
upstream |
861098 |
1012156 ~ 1017351 (+) |
| LOC107197228 |
NA
|
other |
upstream |
1110843 |
682779 ~ 767606 (+) |
| G338430 |
nupr1,LOC103367043,LOC108413333
|
other |
upstream |
1494195 |
379811 ~ 384254 (+) |
| atxn2l |
atxn2l
|
other |
upstream |
1558752 |
303017 ~ 319697 (+) |
| LOC111192057 |
zwi
|
other |
upstream |
1834933 |
36033 ~ 43516 (+) |
Expression