CI01000029_06834812_06849277 (CPSF2)



Basic Information


Item Value
gene id CI01000029_06834812_06849277
gene name CPSF2
gene type coding
species grasscarp (Ctenopharyngodon idella)
category of species economic fish

Chromosome Information


Item Value
chromosome id CI01000029
NCBI id null
chromosome length 9486966
location 6833577 ~ 6849518 (+)
genome version v1_2015/06_NatureGenetics_grasscarp_Genome

Sequence


>CI01000029_06834812_06849277.mRNA
ATATAAAAACAAGTTTATTAATTAGATATATGTGAATTTAGAGTAATTTACCTTCTGCTAGTTTCCTCTGTGTATTATTAGCACAGCAGCGTTAGCAGGAAGGGTTAGAGGGCTTTGGACTGTACCAACTTTTCCTTTCCTCTAATGTTTCTGTGCATATTTATCAAATATATATTGTGCACATTTACATATGTCAAAAGTGAAGGTGATATAAAATCATAAAAAATAAATGTAAATTATAATAAATTATAAATTCTGCTATTATTTGGATGTTTTTGACAGTTTGAAAGGCTAAAGAATTATTACATCCAGCCAAAATCTTCTTTTTTTCTTCTTTTATTTATTTATTTTAGATTTTTCTTATCAAAAACTACATTGTAGAGACATAATTGACCTAACGTGTTACGTGTATCATAAAAATGGGCTATGTGTCGTTTACGGTACGAGACTTTAGTTAGTTATTTGTAAACGCCCCTCTAGTACTGTCAATGGTAGAGTCCAGTGACGTCGACAGCGGCAGTAATGGCTGCTGTTCTCATGCCGCCGATCTGATACAATAATATGAATTTAATATGATTTTTCAGCAATTTTCACCAGCTATTAGTGACCGAACCTGTTGAAATCAATAGTGATATTACAAGATACGGGTATGCAAGTGTTTATGTACATAAATTAAGTCAATTTTAGAAGAGAATCCCTTCACTTGTTTGGCTGCTATGTGTTTTCTTTGAGTTTAGATTAAGTTAAAATCTCAGATTTATTCTAATGTTTTGAAATTGTATAGTGTATGAAGGCTCATGAATACTCATATTCATTCATTCTTTTCCTCTCTCTTTCCTCTCCTGAGCTTACAGCTTTGTTCATTACATACTGCTCTGTTATTTTACCTGCATATTTATAAGTATTGTGCTGTTTTTTCTAATGTTTACAAAGCGTACTGTGGTGGTACTACGGTACAGTGCTGGTATCAGATACTAATATCATGGTATTTGGATAAATACCATTGTTATGTTTATGTATTCTTGCCATATTCATGTATCATTATATACTAATGTATGGAAAAGAATACCATGGTATTACTGTGGTACATGTACCATTACAATACAGTGGTTTTACGAACACTTAACATTGTAGTACTATTGTACCGAATGACTTACATGTTTATATACTCAAGACTAAATGTGGTGTTTTTTTACTAATGATCTCTTGTGTCCATCCATCGGTCAGACGGCAGAATGACATCCATCATCAAGCTGACGGCTCTATCCGGAGTCCAAGAGGAATCGGCTCTGTGTTACCTGTTGCAGGTGGACGAGTTCCGTTTCCTACTGGACTGTGGCTGGGATGAAAGTTTCTCCATGGACATCATAGATTCCCTGAAGCGGTATGTCCATCAGGTGGATGCAGTGCTGCTCTCTCACCCTGATCATCTGCATCTGGGCGCTTTACCGTACGCTGTGGGCAAACTGGGGCTCAACTGCACCATCTACGCCACCATTCCCGTCTACAAGATGGGCCAGATGTTCATGTACGATCTCTATCAGTCTCGACACAACACAGAAGATTTCACTCTGTTCACACTGGATGATGTTGATTCAGCTTTTGACAAAATCCAGCAGCTGAAATACTCTCAGATCGTCAACCTAAAAGGAAAAGGTCACGGTCTGTCCATCACCCCACTTCCTGCGGGTCACATGATTGGTGGCACTATCTGGAAAATAGTGAAGGATGGTGAGGAAGAGATCATCTATGCTGTGGATTTCAACCACAAGAGGGAGATTCATCTTAACGGCTGCTCCCTGGAGACCGTCAGCAGACCGTCTCTCCTCATCACAGACTCCTTCAACGCCTCGTATGTACAGCCTCGCAGGAAACAACGTGACGAGCAACTGCTCACCAATGTCATGGAGACTCTGCGTGGTGACGGGAACGTGCTGATTGCCGTGGATACGGCAGGAAGAGTGCTGGAGCTGGCTCAGCTCCTGGATCAGATCTGGAGGACCAAAGACGCAGGATTGGGCGTCTACTCGCTCGCCCTGCTCAACAACGTCAGCTACAACGTGGTGGAGTTCTCAAAGTCCCAGGTCGAGTGGATGAGCGATAAACTGATGCGATGCTTCGAGGACAAGCGGAACAACCCTTTCCAGTTCCGCCACCTCTCGCTCTGTCACAGCCTGGCCGATCTGTCCCGCGTGCCCAGCCCGAAGGTGGTTCTGTGCAGTCAGCCCGATCTGGAGTCGGGTTTCTCACGAGAGCTCTTCATACAGTGGTGTCAGGACGCCAAGAACTCAGTCATCCTCACCTACAGGACCACACCAGGAACACTGGCCCGCTACCTCATAGATAACCCGGGGGAGAAGAGGATGGACCTGGAGATAAGAAAACGCTGCCGACTGGAAGGACGAGAACTAGAGGAATACTTGGAGAAAGAGAAAATGAAGAAAGAAGCTGCCAAAAAACTGGAGCAGGCGAAAGAGGTGGACTTAGACTCGAGCGATGAGAGCGATATGGAGGACGATCTGGAGCAGCCGGTGGTGGTGAAGACCAAACATCACGACCTGATGATGAAGGGCGAGGGCGGGAGGAAGGGCAGCTTCTTCAAACAGGCCAAGAAGTCCTACCCCATGTTCCCTACACACGAGGAGCGAATCAAATGGGATGAGTACGGAGAAATCATCAGGCCAGAGGAGTTTCTGGTTCCTGAACTCCAGGCCACTGAAGAGGAGAAGAGCAAACTGGAGTCTGGACTAACCAACGGAGACGAGCCCATGGAGCAAGACCTGTCCGACGTCCCCACCAAATGCACCAGCACTACACAAACACTAGAGATCAGAGCTCGAGTGTCGTACATCGATTACGAGGGCCGTTCTGACGGGGACTCCATTAAGAAGATCATAAACCAGATGAAACCCAGACAGCTAGTCATCGTTCACGGCCCGCCTGAGGCCAGTCAGGACCTGGCTGAGTCCTGCAAGGCCTTCAGCGGGAAGGACATCAAAGTTTACATGCCAAAACTGCAGGAGACGGTGGATGCCACCAGCGAGACCCACATTTACCAGGTCAGGCTGAAGGACTCTCTGGTCAGCTCGCTGCAGTTCTGTAAAGCCAAAGACACAGAGCTGGCCTGGATCGATGGCGTTCTGGACATGCGGGTGGAAAAAGTGGACACGGGTGTCATCGCGGAGATGGGAGAAGCGAAAGAGGACACGGAGGATGGAGAACCGGCCATGGATGTCACACCGGACCTGTCGACTGAGCCCAGTGTCACAGCTAACCAGCGGGCCATGAAGACTCTGTTTGGAGAGGATGACAGGGAGATCTCAGAGGAGAGTGACGTCATACCCACGCTGGAGCCGCTGCCTGCTCAGGAGGTCCCAGGTCATCAGTCTGTGTTCATCAACGAACCTCGACTGTCGGACTTCAAGCAGGTTCTGCTGCGTGAAGGGATCCAGGCGGAGTTTGTGGGCGGAGTTCTGGTGTGTAATAATCTGGTCGCTGTCAGGAGGACGGAAGCGGGCCGCATCTGTCTGGAAGGCTGCCATTGTGATGACTACTACAGGATTCGTGAGCTGTTGTATCAGCAGTACGCTGTAGTTTAGCTCTTCAGTGCTGAACAGGTCTCATTCAGCGCCAGCTGTACCCTTAATTTCCTTTTGTAGGTTTTGAGTTGTGTATTTTTTATTATTGTTTCCCTGCACGTTTAAATGGCAGACGTTCTCTTTCGGAGTGACCCCAAATGAGAGCATCTGTCTGGACTGTAGAAGAATCGTTGCTACTGTGCTTTTGTTCAGAACTGCTGGATGCAAAGAAAGGCTTTTTTACCCTTTATTTTGTCCACAG

Function


symbol description
cpsf2 Predicted to enable RNA binding activity. Predicted to be involved in mRNA 3'-end processing by stem-loop binding activity and cleavage; mRNA polyadenylation; and pre-mRNA cleavage required for polyadenylation. Predicted to act upstream of or within mRNA cleavage and mRNA processing. Predicted to be located in nucleus. Predicted to be part of mRNA cleavage and polyadenylation specificity factor complex. Orthologous to human CPSF2 (cleavage and polyadenylation specific factor 2).

GO:

id name namespace
GO:0005654 nucleoplasm cellular_component

KEGG:

id description
K14402 CPSF2, CFT2; cleavage and polyadenylation specificity factor subunit 2

RNA


RNA id representative length rna type GC content exon number start site end site
CI01000029_06834812_06849277.mRNA True 3840 mRNA 0.46 14 6833577 6849518

Neighbor


gene id symbol gene type direction distance location
CI01000029_06824397_06831521 CUNH14ORF169 coding upstream 906 6823629 ~ 6832671 (+)
CI01000029_06775631_06791278 INTS9 coding upstream 42231 6775631 ~ 6791346 (+)
CI01000029_06724205_06734079 HMBOX1B, HMBOX1 coding upstream 97995 6724205 ~ 6735582 (+)
CI01000029_06494856_06495812 NA coding upstream 337730 6494391 ~ 6495847 (+)
CI01000029_06453008_06476863 NA coding upstream 356487 6453008 ~ 6477090 (+)
CI01000029_06865685_06903896 SLC24A4, SLC24A4B coding downstream 16167 6865685 ~ 6903896 (+)
CI01000029_06906861_06907157 NA coding downstream 56316 6905834 ~ 6908633 (+)
CI01000029_06923683_06924032 NA coding downstream 73710 6923228 ~ 6924085 (+)
CI01000029_07132444_07166752 FUT8B, FUT8 coding downstream 282926 7132444 ~ 7166930 (+)
CI01000029_07206718_07211765 NA coding downstream 356169 7205687 ~ 7212408 (+)
G155253 NA non-coding upstream 81799 6751287 ~ 6751778 (+)
G155249 NA non-coding upstream 86015 6744832 ~ 6747562 (+)
G155235 NA non-coding upstream 111020 6722344 ~ 6722557 (+)
G155248 NA non-coding upstream 111393 6721963 ~ 6722184 (+)
G155236 NA non-coding upstream 143640 6689281 ~ 6689937 (+)
G155274 NA non-coding downstream 2958 6852476 ~ 6852878 (+)
G155357 NA non-coding downstream 151921 7001439 ~ 7001888 (+)
G155367 NA non-coding downstream 161578 7011096 ~ 7011301 (+)
G155389 NA non-coding downstream 216187 7065705 ~ 7066055 (+)
G155083 NA other upstream 343283 6431535 ~ 6490294 (+)
G155079 NA other upstream 545179 6282685 ~ 6288398 (+)
CI01000029_05461497_05468985 BATF other upstream 1363231 5461497 ~ 5470346 (+)
G154587 NA other upstream 1562042 5271020 ~ 5271535 (+)
G153790 NA other upstream 2093470 4733589 ~ 4740107 (+)
G155522 NA other downstream 785315 7634833 ~ 7637834 (+)
G155627 NA other downstream 1081962 7931480 ~ 7932177 (+)
G156007 NA other downstream 2115300 8964818 ~ 9064541 (+)

Expression



Co-expression Network


Homologous


species gene id symbol gene type chromosome NCBI id location