thoc5 (thoc5)



Basic Information


Item Value
gene id thoc5
gene name thoc5
gene type coding
species striped catfish (Pangasianodon hypophthalmus)
category of species economic fish

Chromosome Information


Item Value
chromosome id NC_047603.1
NCBI id CM018549.1
chromosome length 29458963
location 19269694 ~ 19280582 (+)
genome version GENO_Phyp_1.0_2019_striped_catfish_Genome

Sequence


>XM_034307053.1
acctttattttgaaaaagGACCCAAAGAAATCCGGAAGTGCAACTGTAGTTGACTTCAACAAATGATCCCGGGCTCGGTTATTGGTTAAAAGTATAGCTTGAGCgaaagaaattaaacaaagCCATGTCTTCTGACCCTCTCAAAAAGCGCAAGACCAAAGTGATCCGGAGTGATGGAGGCACACCTGAGCCTAAAAGGGTTCGAGGAGAAGGAGACCAGGATGTGCGTACATATAACGAAGAAGTGGAACTGGAGAGTCGAGACCCTGAACAGGATTACACCCTGTATAAAAACACCTGTGAGGCACTGGCTACACTAATGGGTGAAATCCAAGAACTGAAAGCCAGTGGAGCTAAAGAAGGGAGTGTAGAGGTTGAAGCAAAACGCAGGCAAGGATGTGTTCACTTcatcactctgaagaagctGAATCGTCTGTCACACATGAGGCTGAAGAAAGCACGGGATCAGACACACGAGGCAAAGCAGAAAGTGGACGTGTTGCACCTACAGCTTCAGAATCTTCTCTATGAAGTCATGCATCTTCAGAAAGAAATCGGGAAATGCCTCGAGTTCAAgtCCCAGCATGAAGAGATTGAGTTAGTCAGTGTGGATGAGTTCTTTAAGGAGGCTCCAGCTGAAATCTCCCGCCCTCATCTCACTAGAGACGACCCTCACCAGCTCACGCTCGCTCGGCTGGACTGGGAACTGGAGCAGAGGAAGAGGTTGGCTGAGCAGTACAAGAGCTCGTTGTGCACTAAGGAAAAGATCTTGAAAGGAATTGAGTTGAAGAAGGAATACCTGAGCAGTCTTCAGCCAGGCCTGCAGGCCATCATGCAGgcGTCTCTGCCCGTGCAGGAGTACTTGTCCATGCCATTTGAGCGTGTGCAGAAACAGGCTGAGGTTGCACGGcaccttcctcctcctctgtaTGTGCTGCTTGTGCAGGCCAATGCTTATGGACAAGCCTGTGacaAGAATCTGTCTGTGTCTATTAGCGGAGATGTGGATGAAGCCAAGGCTCTTTCCAGACCACCCGAGGATTCTCAAGaTGATGAGAGTGATTCTGATGCAGAGGAAGAACAACAGAACACTAAACGACGGCGAGGTACTGTTGGCGTTCAGCTTGATGACAAGCGTAAAGAAATGCTGAAACGGCATCCTCTCTCACTGTGCATCGACCTCaagtgtaaagATGGCAGTGTGttgcatctgtttttttattacctGATGAACCTCAACATTCTCACTGTGAAAGCAAAAGTTTCGGCCTCCGCAGACCTCTCAGGAGCCATCAGTGCAGGGGAGCTGCTGAACCCAGAATCTCTGTTGAACTGCCTGTACGCCAATGATGATGGCCAAGAAACCCCCAACCCTACCAACCGCTACCAGTTTGACAAAGTGGGCATCACAACTTTTGCTGATTATGTGACTGACCTTGGACATCCATACATCTGGGTGCAGAATCTTGGAGGATTACAATTTTCCAGTGACACACCGCAGGTGGAAGTGGTACGCAGTTCTCTCAGTGCCAGTCATATGGAGAGCACCGTGAAGCTGCTGAGAGGACGCCTGCTGTCTCGCCTGGCTTTGCACAAACAGTTCAGCTCTCTAGAGCACAGCATAGTTCCTGTCTCCACCGAATGCCAGCATCTCTTCCCAGCTAAAGTGGTCTCTCGCCTGGCTCGCTGGACTATGATGAGCTGCCAGGACTTCCTGgagttAAGCTTTGTGCAGCATGTGGTGAAGGCTGGTCTGGCGCGGGAGACTGACCTGTTCTTCATGGCAGTAGTGGAGAGGGGAACAGCTCGTCTTCAGGCTGCGGTGGTGTTGAATCCTCGCTATCCAGAGATCACCCCTCTCTTCTCACTTTCACTGCTCTGGAAAGGAGAGCGCAGTGGCCGCACAGATGATAACCTGCGGGCGTTGGAGAGCGAGGTGAATGTGTTTAGGTCTGAACTCCAGGGACCCCGACCAGGCCTACagctactgaccaatcagatccaGCGTCTATGCATGTGTCTGGATGTTTATCTGGAGACCGAAAGCCAAGTGTGTGACGGCTCAGAGGGCCCAAGAGAATTCCCAAGGGAGAAGATGTGCTTACGCTCTGCCAGGGGACCTAATCGTCTGAAGCCTTTTAAGTACAACCACCCTCAGGGCTTTTTCAGTCATCGTTAaatctctgtttttcctcttctctaCATTTTACCTTCGTTTTTACAGCTTGATTAAATTTTCAAGTGTTTGATTCTATATTGTAATGTTTCACTGTGta
>XM_034307054.1
aacaaaggTGAGGACTAATGATCCATGTCTTCTGACCCTCTCAAAAAGCGCAAGACCAAAGTGATCCGGAGTGATGGAGGCACACCTGAGCCTAAAAGGGTTCGAGGAGAAGGAGACCAGGATGTGCGTACATATAACGAAGAAGTGGAACTGGAGAGTCGAGACCCTGAACAGGATTACACCCTGTATAAAAACACCTGTGAGGCACTGGCTACACTAATGGGTGAAATCCAAGAACTGAAAGCCAGTGGAGCTAAAGAAGGGAGTGTAGAGGTTGAAGCAAAACGCAGGCAAGGATGTGTTCACTTcatcactctgaagaagctGAATCGTCTGTCACACATGAGGCTGAAGAAAGCACGGGATCAGACACACGAGGCAAAGCAGAAAGTGGACGTGTTGCACCTACAGCTTCAGAATCTTCTCTATGAAGTCATGCATCTTCAGAAAGAAATCGGGAAATGCCTCGAGTTCAAgtCCCAGCATGAAGAGATTGAGTTAGTCAGTGTGGATGAGTTCTTTAAGGAGGCTCCAGCTGAAATCTCCCGCCCTCATCTCACTAGAGACGACCCTCACCAGCTCACGCTCGCTCGGCTGGACTGGGAACTGGAGCAGAGGAAGAGGTTGGCTGAGCAGTACAAGAGCTCGTTGTGCACTAAGGAAAAGATCTTGAAAGGAATTGAGTTGAAGAAGGAATACCTGAGCAGTCTTCAGCCAGGCCTGCAGGCCATCATGCAGgcGTCTCTGCCCGTGCAGGAGTACTTGTCCATGCCATTTGAGCGTGTGCAGAAACAGGCTGAGGTTGCACGGcaccttcctcctcctctgtaTGTGCTGCTTGTGCAGGCCAATGCTTATGGACAAGCCTGTGacaAGAATCTGTCTGTGTCTATTAGCGGAGATGTGGATGAAGCCAAGGCTCTTTCCAGACCACCCGAGGATTCTCAAGaTGATGAGAGTGATTCTGATGCAGAGGAAGAACAACAGAACACTAAACGACGGCGAGGTACTGTTGGCGTTCAGCTTGATGACAAGCGTAAAGAAATGCTGAAACGGCATCCTCTCTCACTGTGCATCGACCTCaagtgtaaagATGGCAGTGTGttgcatctgtttttttattacctGATGAACCTCAACATTCTCACTGTGAAAGCAAAAGTTTCGGCCTCCGCAGACCTCTCAGGAGCCATCAGTGCAGGGGAGCTGCTGAACCCAGAATCTCTGTTGAACTGCCTGTACGCCAATGATGATGGCCAAGAAACCCCCAACCCTACCAACCGCTACCAGTTTGACAAAGTGGGCATCACAACTTTTGCTGATTATGTGACTGACCTTGGACATCCATACATCTGGGTGCAGAATCTTGGAGGATTACAATTTTCCAGTGACACACCGCAGGTGGAAGTGGTACGCAGTTCTCTCAGTGCCAGTCATATGGAGAGCACCGTGAAGCTGCTGAGAGGACGCCTGCTGTCTCGCCTGGCTTTGCACAAACAGTTCAGCTCTCTAGAGCACAGCATAGTTCCTGTCTCCACCGAATGCCAGCATCTCTTCCCAGCTAAAGTGGTCTCTCGCCTGGCTCGCTGGACTATGATGAGCTGCCAGGACTTCCTGgagttAAGCTTTGTGCAGCATGTGGTGAAGGCTGGTCTGGCGCGGGAGACTGACCTGTTCTTCATGGCAGTAGTGGAGAGGGGAACAGCTCGTCTTCAGGCTGCGGTGGTGTTGAATCCTCGCTATCCAGAGATCACCCCTCTCTTCTCACTTTCACTGCTCTGGAAAGGAGAGCGCAGTGGCCGCACAGATGATAACCTGCGGGCGTTGGAGAGCGAGGTGAATGTGTTTAGGTCTGAACTCCAGGGACCCCGACCAGGCCTACagctactgaccaatcagatccaGCGTCTATGCATGTGTCTGGATGTTTATCTGGAGACCGAAAGCCAAGTGTGTGACGGCTCAGAGGGCCCAAGAGAATTCCCAAGGGAGAAGATGTGCTTACGCTCTGCCAGGGGACCTAATCGTCTGAAGCCTTTTAAGTACAACCACCCTCAGGGCTTTTTCAGTCATCGTTAaatctctgtttttcctcttctctaCATTTTACCTTCGTTTTTACAGCTTGATTAAATTTTCAAGTGTTTGATTCTATATTGTAATGTTTCACTGTGta
>XM_034307055.1
TGATCAAATCAATGCTTGTGTGAATGGAAGATCTTGCGTGAGGtttgaaaataaacaagtgGACCACCTCGTTGCTAGCAGGCTAAGCTAGATACTGTGGATTAGCGCCATGTCTTCTGACCCTCTCAAAAAGCGCAAGACCAAAGTGATCCGGAGTGATGGAGGCACACCTGAGCCTAAAAGGGTTCGAGGAGAAGGAGACCAGGATGTGCGTACATATAACGAAGAAGTGGAACTGGAGAGTCGAGACCCTGAACAGGATTACACCCTGTATAAAAACACCTGTGAGGCACTGGCTACACTAATGGGTGAAATCCAAGAACTGAAAGCCAGTGGAGCTAAAGAAGGGAGTGTAGAGGTTGAAGCAAAACGCAGGCAAGGATGTGTTCACTTcatcactctgaagaagctGAATCGTCTGTCACACATGAGGCTGAAGAAAGCACGGGATCAGACACACGAGGCAAAGCAGAAAGTGGACGTGTTGCACCTACAGCTTCAGAATCTTCTCTATGAAGTCATGCATCTTCAGAAAGAAATCGGGAAATGCCTCGAGTTCAAgtCCCAGCATGAAGAGATTGAGTTAGTCAGTGTGGATGAGTTCTTTAAGGAGGCTCCAGCTGAAATCTCCCGCCCTCATCTCACTAGAGACGACCCTCACCAGCTCACGCTCGCTCGGCTGGACTGGGAACTGGAGCAGAGGAAGAGGTTGGCTGAGCAGTACAAGAGCTCGTTGTGCACTAAGGAAAAGATCTTGAAAGGAATTGAGTTGAAGAAGGAATACCTGAGCAGTCTTCAGCCAGGCCTGCAGGCCATCATGCAGgcGTCTCTGCCCGTGCAGGAGTACTTGTCCATGCCATTTGAGCGTGTGCAGAAACAGGCTGAGGTTGCACGGcaccttcctcctcctctgtaTGTGCTGCTTGTGCAGGCCAATGCTTATGGACAAGCCTGTGacaAGAATCTGTCTGTGTCTATTAGCGGAGATGTGGATGAAGCCAAGGCTCTTTCCAGACCACCCGAGGATTCTCAAGaTGATGAGAGTGATTCTGATGCAGAGGAAGAACAACAGAACACTAAACGACGGCGAGGTACTGTTGGCGTTCAGCTTGATGACAAGCGTAAAGAAATGCTGAAACGGCATCCTCTCTCACTGTGCATCGACCTCaagtgtaaagATGGCAGTGTGttgcatctgtttttttattacctGATGAACCTCAACATTCTCACTGTGAAAGCAAAAGTTTCGGCCTCCGCAGACCTCTCAGGAGCCATCAGTGCAGGGGAGCTGCTGAACCCAGAATCTCTGTTGAACTGCCTGTACGCCAATGATGATGGCCAAGAAACCCCCAACCCTACCAACCGCTACCAGTTTGACAAAGTGGGCATCACAACTTTTGCTGATTATGTGACTGACCTTGGACATCCATACATCTGGGTGCAGAATCTTGGAGGATTACAATTTTCCAGTGACACACCGCAGGTGGAAGTGGTACGCAGTTCTCTCAGTGCCAGTCATATGGAGAGCACCGTGAAGCTGCTGAGAGGACGCCTGCTGTCTCGCCTGGCTTTGCACAAACAGTTCAGCTCTCTAGAGCACAGCATAGTTCCTGTCTCCACCGAATGCCAGCATCTCTTCCCAGCTAAAGTGGTCTCTCGCCTGGCTCGCTGGACTATGATGAGCTGCCAGGACTTCCTGgagttAAGCTTTGTGCAGCATGTGGTGAAGGCTGGTCTGGCGCGGGAGACTGACCTGTTCTTCATGGCAGTAGTGGAGAGGGGAACAGCTCGTCTTCAGGCTGCGGTGGTGTTGAATCCTCGCTATCCAGAGATCACCCCTCTCTTCTCACTTTCACTGCTCTGGAAAGGAGAGCGCAGTGGCCGCACAGATGATAACCTGCGGGCGTTGGAGAGCGAGGTGAATGTGTTTAGGTCTGAACTCCAGGGACCCCGACCAGGCCTACagctactgaccaatcagatccaGCGTCTATGCATGTGTCTGGATGTTTATCTGGAGACCGAAAGCCAAGTGTGTGACGGCTCAGAGGGCCCAAGAGAATTCCCAAGGGAGAAGATGTGCTTACGCTCTGCCAGGGGACCTAATCGTCTGAAGCCTTTTAAGTACAACCACCCTCAGGGCTTTTTCAGTCATCGTTAaatctctgtttttcctcttctctaCATTTTACCTTCGTTTTTACAGCTTGATTAAATTTTCAAGTGTTTGATTCTATATTGTAATGTTTCACTGTGta
>XM_034307056.1
AATTGATCAAATCAATGCTTGTGTGAATGGAAGATCTTGCGTGAGCCATGTCTTCTGACCCTCTCAAAAAGCGCAAGACCAAAGTGATCCGGAGTGATGGAGGCACACCTGAGCCTAAAAGGGTTCGAGGAGAAGGAGACCAGGATGTGCGTACATATAACGAAGAAGTGGAACTGGAGAGTCGAGACCCTGAACAGGATTACACCCTGTATAAAAACACCTGTGAGGCACTGGCTACACTAATGGGTGAAATCCAAGAACTGAAAGCCAGTGGAGCTAAAGAAGGGAGTGTAGAGGTTGAAGCAAAACGCAGGCAAGGATGTGTTCACTTcatcactctgaagaagctGAATCGTCTGTCACACATGAGGCTGAAGAAAGCACGGGATCAGACACACGAGGCAAAGCAGAAAGTGGACGTGTTGCACCTACAGCTTCAGAATCTTCTCTATGAAGTCATGCATCTTCAGAAAGAAATCGGGAAATGCCTCGAGTTCAAgtCCCAGCATGAAGAGATTGAGTTAGTCAGTGTGGATGAGTTCTTTAAGGAGGCTCCAGCTGAAATCTCCCGCCCTCATCTCACTAGAGACGACCCTCACCAGCTCACGCTCGCTCGGCTGGACTGGGAACTGGAGCAGAGGAAGAGGTTGGCTGAGCAGTACAAGAGCTCGTTGTGCACTAAGGAAAAGATCTTGAAAGGAATTGAGTTGAAGAAGGAATACCTGAGCAGTCTTCAGCCAGGCCTGCAGGCCATCATGCAGgcGTCTCTGCCCGTGCAGGAGTACTTGTCCATGCCATTTGAGCGTGTGCAGAAACAGGCTGAGGTTGCACGGcaccttcctcctcctctgtaTGTGCTGCTTGTGCAGGCCAATGCTTATGGACAAGCCTGTGacaAGAATCTGTCTGTGTCTATTAGCGGAGATGTGGATGAAGCCAAGGCTCTTTCCAGACCACCCGAGGATTCTCAAGaTGATGAGAGTGATTCTGATGCAGAGGAAGAACAACAGAACACTAAACGACGGCGAGGTACTGTTGGCGTTCAGCTTGATGACAAGCGTAAAGAAATGCTGAAACGGCATCCTCTCTCACTGTGCATCGACCTCaagtgtaaagATGGCAGTGTGttgcatctgtttttttattacctGATGAACCTCAACATTCTCACTGTGAAAGCAAAAGTTTCGGCCTCCGCAGACCTCTCAGGAGCCATCAGTGCAGGGGAGCTGCTGAACCCAGAATCTCTGTTGAACTGCCTGTACGCCAATGATGATGGCCAAGAAACCCCCAACCCTACCAACCGCTACCAGTTTGACAAAGTGGGCATCACAACTTTTGCTGATTATGTGACTGACCTTGGACATCCATACATCTGGGTGCAGAATCTTGGAGGATTACAATTTTCCAGTGACACACCGCAGGTGGAAGTGGTACGCAGTTCTCTCAGTGCCAGTCATATGGAGAGCACCGTGAAGCTGCTGAGAGGACGCCTGCTGTCTCGCCTGGCTTTGCACAAACAGTTCAGCTCTCTAGAGCACAGCATAGTTCCTGTCTCCACCGAATGCCAGCATCTCTTCCCAGCTAAAGTGGTCTCTCGCCTGGCTCGCTGGACTATGATGAGCTGCCAGGACTTCCTGgagttAAGCTTTGTGCAGCATGTGGTGAAGGCTGGTCTGGCGCGGGAGACTGACCTGTTCTTCATGGCAGTAGTGGAGAGGGGAACAGCTCGTCTTCAGGCTGCGGTGGTGTTGAATCCTCGCTATCCAGAGATCACCCCTCTCTTCTCACTTTCACTGCTCTGGAAAGGAGAGCGCAGTGGCCGCACAGATGATAACCTGCGGGCGTTGGAGAGCGAGGTGAATGTGTTTAGGTCTGAACTCCAGGGACCCCGACCAGGCCTACagctactgaccaatcagatccaGCGTCTATGCATGTGTCTGGATGTTTATCTGGAGACCGAAAGCCAAGTGTGTGACGGCTCAGAGGGCCCAAGAGAATTCCCAAGGGAGAAGATGTGCTTACGCTCTGCCAGGGGACCTAATCGTCTGAAGCCTTTTAAGTACAACCACCCTCAGGGCTTTTTCAGTCATCGTTAaatctctgtttttcctcttctctaCATTTTACCTTCGTTTTTACAGCTTGATTAAATTTTCAAGTGTTTGATTCTATATTGTAATGTTTCACTGTGta

Function


symbol description
thoc5 Predicted to enable mRNA binding activity. Predicted to be involved in mRNA export from nucleus and positive regulation of DNA-templated transcription, elongation. Predicted to act upstream of or within RNA processing; cell differentiation; and mRNA transport. Predicted to be located in cytoplasm and nuclear speck. Predicted to be part of THO complex part of transcription export complex. Human ortholog(s) of this gene implicated in breast carcinoma. Orthologous to human THOC5 (THO complex 5).

NR:

description
PREDICTED: THO complex subunit 5 homolog

GO:

id name namespace
GO:0030154 cell differentiation biological_process
GO:0006397 mRNA processing biological_process
GO:0051028 mRNA transport biological_process
GO:0008380 RNA splicing biological_process
GO:0016607 nuclear speck cellular_component
GO:0005737 cytoplasm cellular_component
GO:0003723 RNA binding molecular_function

KEGG:

id description
K13174 THOC5; THO complex subunit 5

RNA


RNA id representative length rna type GC content exon number start site end site
XM_034307053.1 True 2276 mRNA 0.49 20 19269694 19280582
XM_034307054.1 False 2176 mRNA 0.50 20 19269809 19280582
XM_034307055.1 False 2261 mRNA 0.50 20 19269859 19280582
XM_034307056.1 False 2199 mRNA 0.50 20 19269856 19280582

Neighbor


gene id symbol gene type direction distance location
LOC113530162 LOC108274540,LOC103046250,LOC108428357 coding upstream 29056 19237672 ~ 19240638 (+)
prnprs3 NA coding upstream 43278 19221929 ~ 19226416 (+)
incenp incenp,LOC107701072 coding upstream 53347 19204494 ~ 19216347 (+)
ccdc77 ccdc77,LOC107754361 coding upstream 94590 19167269 ~ 19175104 (+)
LOC113529841 LOC108274949,LOC108428426 coding upstream 107546 19154291 ~ 19162148 (+)
chid1 chid1,LOC107674841 coding downstream 5203 19285785 ~ 19292110 (+)
mov10l1 mov10l1 coding downstream 150844 19431426 ~ 19446347 (+)
szl LOC108274651,LOC108428364 coding downstream 168961 19449543 ~ 19451833 (+)
dbx2 dbx2 coding downstream 176167 19456749 ~ 19464228 (+)
nell2a nell2,LOC108428412,LOC107661281,LOC100380741 coding downstream 194936 19475518 ~ 19542669 (+)
G151395 NA non-coding upstream 41801 19226649 ~ 19227893 (+)
G151134 NA non-coding upstream 158721 19109843 ~ 19110973 (+)
G151084 NA non-coding upstream 271762 18955049 ~ 18997932 (+)
G151093 NA non-coding upstream 323752 18945275 ~ 18945942 (+)
LOC113530068 NA non-coding upstream 648873 18601539 ~ 18620821 (+)
G151417 NA non-coding downstream 37404 19317986 ~ 19393161 (+)
G151419 NA non-coding downstream 113292 19393874 ~ 19397489 (+)
G151421 NA non-coding downstream 117696 19398278 ~ 19400388 (+)
G151445 LOC108274686,LOC108415052 non-coding downstream 122727 19403309 ~ 19403657 (+)
G151447 NA non-coding downstream 123390 19403972 ~ 19404398 (+)
G151401 NA other upstream 5801 19260624 ~ 19263893 (+)
faap24 faap24 other upstream 144493 19121151 ~ 19125201 (+)
nfat5a LOC108274760,LOC108428435 other upstream 473435 18747827 ~ 18796259 (+)
urahb LOC108274538 other upstream 545008 18721595 ~ 18724686 (+)
rfwd3 rfwd3,LOC107754982 other upstream 586266 18675010 ~ 18683428 (+)
srpk2 srpk2,LOC106907745,LOC108428417,LOC103366878,LOC101063511,LOC103134474 other downstream 42749 19323331 ~ 19384904 (+)
LOC113529760 adamts20,LOC107733613,LOC106907901 other downstream 381261 19661843 ~ 19748080 (+)
LOC113529593 LOC108275426,LOC108432651,LOC103374925,LOC107389115,LOC106531033 other downstream 2772032 22052614 ~ 22058158 (+)
arpp19b arpp19,LOC108275368,LOC107740885,LOC107569706,LOC103909813,LOC107673248,LOC107670207,LOC107559970 other downstream 2796267 22076849 ~ 22078496 (+)
G153014 zmp:0000001167,LOC108274702,LOC107754369,LOC107551438,LOC107561184,LOC107655150 other downstream 2935693 22216275 ~ 22266382 (+)

Expression



Co-expression Network