CI01000026_09900193_09906874 (SALL1A, SALL1)



Basic Information


Item Value
gene id CI01000026_09900193_09906874
gene name SALL1A, SALL1
gene type coding
species grasscarp (Ctenopharyngodon idella)
category of species economic fish

Chromosome Information


Item Value
chromosome id CI01000026
NCBI id null
chromosome length 17172400
location 9900193 ~ 9907237 (+)
genome version v1_2015/06_NatureGenetics_grasscarp_Genome

Sequence


>CI01000026_09900193_09906874.mRNA
ATGATTTTTGAGGGACAAAAAGAGAGGAGGTGGGGGGACACAGAGACCAACCCGGACGTCTCCCCCTGTAAAGACTCGGGTGCCCATGTCTGCAGCAGATGTTGCGCAGAGTTCTTTGAACTATCGGACCTTGAAAAACACCAGAAGAATTGCACTAAGAATCAATTAGTTCTAATTGTGAATGAAAATCCAGTGTCTCCTTCCGGAACCTTCTCCCCAGGATCCTCTCCTCATAATCCTGATGAGCAGATGAATGATACAACTAATAACACAGATCAAGCAGAGTTGACCGACCTTTTGGAGCAGAACATACTTGACAAAGAGGAGGCCATGGACACAGAGATGTCAGGAGTCTCTGCACCCCATAATGATGGAAGTGGATGTATGGCAGGCGGCAGCCCACTCAGTGGTATTGGAAGCAGCCATGGAGCTCTGGGCAGCTCTGGGTCCAGTATGGGAAACTCTGTCATCTCTGCCTCACTACCTCAGCTGGGTAACTTGACTGAGTTGGGCAATTTTTCCATGATCAACAGCAACGTCATTATTGAAAACCTACAGAGCACCAAAGTGGCTGTGGCCCAGTTCTCTCAAGAGACCAGAGCAGCGGGAGGGCAAAGAGTGGCTGTCCCAGCCCTCATGGAGCAGCTCTTGGCCCTACAGCAGCAGCAGATCCATCAGCTACAGCTCATTGAGCAGATTCGTCACCAGATTCTTCTGCTAGCGTCACAGTCACCTGAGCTCCAAGCACCACCTAGTTCTTCCCCTGTGACTTTAGGTTCAGGTACTAGCCCGCTAACCACTCTTAGTTCCCATTTGTCTCAACAGTTGGCAGCTGCTGCAGGGTTGGCACAGAGCATAGCTAGCCAGTCTGCAAGCATTAGCAGTTTGAAACAGATGGCTAATGTTGCGCAGCTACCTCAGAGCAATTCCAGTGTGGGAGAGTCATCTCAAAGCCTTGGCACACCAGGGCCAGCTACAGTTAATGCCCAACCTTCAGACAAGAGGCCAGGCAATACAGGAAGCTTGCATCCTCAGTTAGGAAACCCATCACTTGCAAAGTCATCCACACCAGCCTTTGCAATGGGAAGCCTTTTGAATCCTGTGGCTAATACCCTTCTACCTCAGCCCCCACCAGGCAACCCAATATTTTCCAGTGCACTGCCCAGTGTTGGTACTACAGTTGAGGATCTCAACTCACTGGCTGCACTGGCTCAGCAAAGAAAAGGCAAGCCTCCTAATGTATCATCGTTTGAACCGAAGAGCAGCTCTGAGGATACGTTTTTCAAACATAAGTGCAGGTTTTGTGGCAAGGTGTTTGGTAGTGACAGTGCATTGCAGATTCACTTACGTTCTCATACAGGTGAGAGGCCGTACAAGTGCAACATATGTGGCAACCGCTTTTCCACCCGTGGTAACTTGAAGGTTCATTTTCAGCGTCATAAAGAGAAGTATCCACACATTCAGATGAACCCATACCCTGTCCCAGAGCATCTAGACAATATTCCAACGAGTACTGGCATTCCCTATGGTATGTCTATGCCTCCTGAGAAGCCAGTGACTAGTTGGCTCGATAGTAAACCTGTTCTGTCTACTCTGACATCTTCTGTTGGCATGTTACTTCCACCAACTATACCAAGCCTGCCTCCATTCATCAAGAAGGAAGAGAATAACTCAGTGGCTATAAGCAGCCCTTCTCATTCTGCTAAAAGCGACTCAGGTCCTGCTGATACACCTATGAAGAACACAGATAGTGTGTTGGAAGAAGGTGAGTGCACAACCCTGCCTACCTCAAATGGAAAAGCTGAAGAAAACAATCAGTCCTCCAGTTTAACAACAAACATGAGCTCTGCAGTGGAGGGTACCATTGAGTACACCACCTCCAACAGCCCTCCAATGGCCACTAACCCACTCATGCCTCTGATGTCAGAACAGTTCAAAGCTAAGTTTCCATTCGGGGGTCTCCTTGATCCTCTCCAAGGCTCAGAAACTTCTAAGCTTCAGCAACTTGTTGAGAATATTGACAGGAAGGTGACAGACCCCAATGAGTGTGTTATCTGCCACCGTGTTCTTAGTTGCCAGAGTGCACTGAAAATGCACTACCGCACACACACTGGGGAGAGGCCTTTCAAATGTAAAGTGTGTGGTCGTGCCTTCACCACCAAGGGCAACCTCAAGACTCATTACAGTGTCCATCGAGCTATGCCGCCACTGAGGGTCCAGCATTCTTGTCCTATTTGCCAGAAGAAATTCACGAATGCTGTGGTACTGCAGCAGCATATTCGCATGCACATGGGTGGCCAGATCCCCAACACCCCTCTTCCTGACAACTATCCAGAATCCATGGGGTCCGATGCTGGCTCATTTGACGAGAGAAACTTTGATGATCTTGACAACTTCTCTGATGAAAACATGGAAGGAATTGAGGATGGACCAGACAGCAGCATTCCAGACACACCAAAGTCAGCTGATGCATCTCAAGACAGCGTCTGTTCATCCCCTACTCCACAGGAGTTATTAGGTATGGAGAGCCAGAAGTGTGCCAACCAGGGAGCAGGAGAGGAAGTGCAGTGTGACCAAGGGAAATCATTGGAGAACGGATTAATGGAAGGAGACAGACTCACAAATGACTCCTCTTCACTTGGAGGTGACATCGAAAGCCAAAGTGCCGGAAGTCCAGCTGTCTCTGAATCTACCTCTTCCATGCAAGCTCCATCACCCTCTAATGGTACGATCCAACAGCAAAAGTCTCCTAGTCTGGAGGATCGGCAGCAGAGGGCATTATCATTGGAGCACATTAGTGCAGGTCTCATGCAGTCTCATTCTGCCAGTCCTGGAGCTCTGGATCTTACTTCAATTAACTCATCAAAAGATCCTCTTAGCATGCTTTTCCCCTTCCGTGAGCGAGGCACCCTGAAGAATACAGCATGTGACATCTGTGGCAAGACATTTGCCTGCCAGAGTGCCTTGGACATCCATTACCGTAGCCATACCAAAGAGAGACCGTTCATTTGCACAGTATGCAACCGGGGCTTTTCCACCAAGGGGAACTTGAAGCAACATATGCTGACCCATCAAATGCGAGATTTGCCCTCCCAGCTCTTCGAACCCAACACCAGTCTTGCCTCTAGCCCAACTCCATCCCTTCTGTCTGTTGGACCTTTGACCTCCATGATGAAGACTGAGGTCAACGGCTTTCTGCATGGTCTTCAACCTGAAGTAAAAGACTTGCCCTCTGCACTAGTGACCTCATCTGCTTCCACCTCTCCAGTACTTTCGACTGCCCCACCACGGAGGACGCCTAAACAGCACTACTGTACCACTTGTGGGAAGACATTTTCCTCCTCCAGTGCCCTGCAAATCCATGAGAGAACCCATACGGGAGAGAAGCCCTTTGCTTGTACCATTTGTGGACGAGCATTTACCACCAAAGGAAATCTCAAGGTGCATATGGGTACCCACATGTGGAACAGTGCTCCTGCACGGCGAGGTCGCAGGCTTTCAGTAGATGGTCCCATGGCTTTCCTGGGCACCAACCCTGTAAAGTTTCCCGAGCTCTTCCAAAAGGACCTAACCAGTAGAGCTGGAAACGGAGACCCAACCAGCTTCTGGAATCAGTATGCTGCTGCTTTCTCTAACGGCCTGGCCATGAAAACCAACGAGATCTCTGTGATCCAGAATGGAGGTCTGCCCCCTCCACTGTCAGGGAGTGTGGGCAATGGGGGCAGTTCTCCGATCGGAGGCTTGACGGGCAGCATGGAAAAGCTGCATAACTCTGAGCCCAATCCTGCTCTGGCTGGCCTAGAGAAAATGGCCAACACAGAGAATGGGACTCACTTCCGCTTCACACGCTTCATGGAGGACAAGGAGATTGTGACCAACTAGAGAATCGTTTTTGTTGTGCTCATGAACAAACATACACTCTTTAACTGGCACACACAAGTGCTCCGTATATGTGTCCATGAAAGAGAAAAGCAAGAAAAAACTGAATCAGCTTTTGCTTGCCATTAAGACTTTATGAAGAGGAGTTTGAAAAGAACTTGCTTTTTTTGTACAATGTACAACACGTTATAGTTTTTGGAGCTTATTTATTAGTGATTATAACCTTGCTCGAAAACCAAGTGGAAATATTAACAGTTACTTAGTTTTTATGTTATTTTCTGTTTATCAGCAGAGTGAATCTGTATGAGTGTTTTAAAGTGTGCTTTTGAAACTTAGCTTTTCTTGGTAGAAGACTTCACCATGGCA

Function


symbol description
sall1a Predicted to enable DNA-binding transcription factor activity, RNA polymerase II-specific and RNA polymerase II cis-regulatory region sequence-specific DNA binding activity. Acts upstream of or within embryonic pectoral fin morphogenesis and positive regulation of fibroblast growth factor receptor signaling pathway. Predicted to be active in nucleus. Is expressed in several structures, including fin; nervous system; neural keel; neural tube; and tail bud. Human ortholog(s) of this gene implicated in Townes-Brocks syndrome and middle lobe syndrome. Orthologous to human SALL1 (spalt like transcription factor 1).
sall1 Enables beta-catenin binding activity. Involved in several processes, including animal organ development; embryonic digit morphogenesis; and regulation of transcription by RNA polymerase II. Located in several cellular components, including chromocenter; heterochromatin; and nucleus. Implicated in Townes-Brocks syndrome and middle lobe syndrome. Biomarker of hepatoblastoma.

GO:

id name namespace
GO:0043565 sequence-specific DNA binding molecular_function

KEGG:

id description
K19871 SALL; sal-like protein

RNA


RNA id representative length rna type GC content exon number start site end site
CI01000026_09900193_09906874.mRNA True 4245 mRNA 0.48 3 9900193 9907237

Neighbor


gene id symbol gene type direction distance location
CI01000026_09666763_09669325 NA coding upstream 230491 9666680 ~ 9669702 (+)
CI01000026_09602263_09635635 TOX3 coding upstream 263199 9601643 ~ 9636994 (+)
CI01000026_09365911_09428712 CHD9 coding upstream 471117 9364907 ~ 9429076 (+)
CI01000026_09339450_09352303 AKTIP coding upstream 547016 9339450 ~ 9353177 (+)
CI01000026_09175446_09178464 NA coding upstream 721705 9175249 ~ 9178488 (+)
CI01000026_09915483_09917692 NA coding downstream 8208 9915445 ~ 9917919 (+)
CI01000026_10124232_10124468 NA coding downstream 216679 10123916 ~ 10125835 (+)
CI01000026_10136110_10149386 BRD7 coding downstream 228759 10135996 ~ 10149466 (+)
CI01000026_10200749_10202405 NA coding downstream 291356 10198593 ~ 10202514 (+)
CI01000026_10265012_10268157 NA coding downstream 357708 10264945 ~ 10268548 (+)
G132916 NA non-coding upstream 50490 9849234 ~ 9849703 (+)
G132908 NA non-coding upstream 60929 9839015 ~ 9839264 (+)
G132888 NA non-coding upstream 82228 9817686 ~ 9817965 (+)
G132879 NA non-coding upstream 96593 9803342 ~ 9803600 (+)
G132868 NA non-coding upstream 112115 9787856 ~ 9788078 (+)
G132800 NA non-coding downstream 12149 9919386 ~ 9920088 (+)
G132953 NA non-coding downstream 23248 9930485 ~ 9930713 (+)
G132951 NA non-coding downstream 71236 9978473 ~ 9979099 (+)
G132950 NA non-coding downstream 75790 9983027 ~ 9983920 (+)
G132949 NA non-coding downstream 80902 9988139 ~ 9988663 (+)
G132667 NA other upstream 331033 9568739 ~ 9569160 (+)
G131077 NA other upstream 1393406 8501408 ~ 8506787 (+)
CI01000026_08297901_08307596 CTRB1 other upstream 1600779 8297857 ~ 8307629 (+)
G130967 NA other upstream 1711120 8188374 ~ 8189073 (+)
G130552 NA other upstream 2803541 7095053 ~ 7096652 (+)
G133192 NA other downstream 252582 10159819 ~ 10161961 (+)
CI01000026_11075931_11078249 MDK, MDKA other downstream 1163531 11075784 ~ 11079080 (+)
CI01000026_11241496_11246081 NA other downstream 1337696 11241496 ~ 11246204 (+)
G133542 NA other downstream 1568880 11476117 ~ 11477544 (+)

Expression



Co-expression Network


Homologous


species gene id symbol gene type chromosome NCBI id location
zebrafish (Danio rerio) XLOC_033904 sall1a coding NC_007118.7 CM002891.2 37359004 ~ 37392172 (+)
bowfin (Amia calva) AMCG00020306 sall1,LOC106587587 coding CM030127.1 CM030127.1 15760085 ~ 15770745 (+)