sox1a (LOC103372296,LOC101481640,LOC100691128)



Basic Information


Item Value
gene id sox1a
gene name LOC103372296,LOC101481640,LOC100691128
gene type misc
species eurasian perch (Perca fluviatilis)
category of species economic fish

Chromosome Information


Item Value
chromosome id NC_053123.1
NCBI id CM020920.1
chromosome length 39440604
location 21619348 ~ 21634757 (+)
genome version GENO_Pfluv_1.0_2020_eurasian_perch_Genome

Sequence


>XM_039819002.1
TGCAAATCACTCTGCTGCTAGAGCCTCATTGGCCGCTTGTGGCTCATTTAACCGCTGTCAGTGCGCATCATATGGCCGCCGCTTTAACCAACTTCTCAACTCCGACAGGGAAAGTAGCTCAATGCGCATTCTATCACAGTGGATGTGAACCAACCCATGCGAAGACTGGGTGAAATACTGTTTGAGGAACATTTTATATCCTTTTTCGGATTATTTTTTATCTCGTATTTGAATCATTTGGGTTTAGAATTTTAATTTTGCAACCAAGTCATCCTGTGCACACCCGATCTTGTGCGCGCCTATCTTGAAATATACTTTTTCTCTGTTGGTGGCGAAGTTTAAAATCAGGTGAATGTATAGCATGATGATGGAAACTGACCTCCATTCCCCTGGCCCCCAAACGAACACTACCACGGGGCAAACGGGGCCGAACAGCGGGTCCAAAGCGAATCAGGAACGGGTGAAGAGACCGATGAACGCGTTCATGGTGTGGTCCCGTGGGCAGCGGAGAAAGATGGCACAGGAGAACCCCAAAATGCACAACTCAGAGATCAGCAAACGGCTGGGCGCCGAGTGGAAGGTGATGTCCGAGGCTGAGAAGCGCCCTTTCATCGACGAGGCGAAACGGTTGCGAGCTATGCACATGAAGGAGCATCCGGATTACAAGTACCGGCCAAGACGGAAGACCAAGACGCTGCTGAAAAAGGACAAATATTCCCTTGCCGGTGGACTGCTTTCTGGCCCCAGTGGGGGCGGTGGAGTTGGTTTAGGGGTCGGAATGAGTTCATCCGGGGTCGGGCAGAGGTTAGAGAGCCCCGTGGGTCACGTAGGCTCCGCCAGCTCCGGCTATGCGCACATGAACGGCTGGGCGAACGGTGCGTACTCGGGGCAGGTGGCTGCAGCCGCGGCGGCGGCAGCGATGATGCAGGAGGCTCAGCTAGCCTACAGCCAGCACCCGGGGAGCGGAgcgcaccaccaccaccactcacACCACCATCACTCACACAACCCCCAGCCCATGCACCGCTACGACATGACAGCCCTGCAGTACAGCCCCATCTCGAACTCTCAGGGCTACATGAACACTTCTCCATCCGGCTACGGTGGGATCACCTATACACAGCACCAGGGCTCCGGCGTCTCCTCCTCAGCTGCCATGGGGACGTTAGGGTCGCTCGTGAAGTCGGAGCCGAGTatttCTCCGTTGTTGGAATAGCATCTGCACTGGAACTTTTTGACAAAAGCAAACTGTCCATCATGGCTATCACCAGTGGACATGGATCACTATGGGGCCTGTGATTAACAGATAGCACCTCGTCTCCTCTGCTGGCAGGCCATCATTACGGAAAGCACTAGAAGATAGGTGCTGTGGGCGAAGAGACCTTTAGGAAAATGTCAGCGACAGTCAGAGCCCCAAATTTCCTCTATTCACTCTTTCCCATGTGGTGCTAAACCCCTTAATTACACAACCCCTTTCTCCTATTGCCTAATGTATCTAAATACAGTGTCTGATTTCAGAGCATGTGgagattttattttatcttatgtggctgttaaataaaaaaaataaaaaaataaacaaccaCAGCCCTATATTTCACCCATTcagcaaagaacaaaaaaacacccaGCTCCCCTTTCCCATTCCCTTTCACTGGAGCCAGCTTCTCTGTCTTAATACTCTTAGTCCCTCAGTTGGCATGTTTTCCAAGAACCCGCTGCATTTTCAGATTTGTAAAATCCGTCAAAATGCCTGTAATGAATATTTCTACAGCAGTGAGTGTATTTAATGTGCTGTGTGACCAGCCCAATAAGAAACCTGTACTGATCTGGCACTAGGAACCAGAGTGTCAGTAGACAGAAGCCTCAGTCAGTGTAGAAAACAGTATAAAACAGATAATTTTCATTCAAACACAAACCTCTTTTATCAAGAGAAGCTGAAAGGAAAACATTTATTATTCACATATCATCTTAGGTGTCAATTAACATCTTTCTAGTGCATGAAGCTACATGTCATGTATAtttctatttaaaattttcAAAAAAGAAACTACTATTAGTAATTCTTTCTTCATGGATTAACGTCATCTTGTGAGCCAGGTCTTTGTTATACACTTACAATGTATGGTGTGGTTTACTGTTGTCTTAGCTTttgctgaaaaaaaagagaaaacatattaaataaaat
>XM_039819003.1
ATGCAAATCACTCTGCTGCTAGAGCCTCATTGGCCGCTTGTGGCTCATTTAACCGCTGTCAGTGCGCATCATATGGCCGCCGCTTTAACCAACTTCTCAACTCCGACAGGGAAAGTAGCTCAATGCGCATTCTATCACAGTGGATGTGAACCAACCCATGCGAAGACTGGGTGAAATACTGTTTGAGGAACATTTTATATCCTTTTTCGGATTATTTTTTATCTCGTATTTGAATCATTTGGGTTTAGAATTTTAATTTTGCAACCAAGTCATCCTGTGCACACCCGATCTTGTGCGCGCCTATCTTGAAATATACTTTTTCTCTGTTGGTGGCGAAGTTTAAAATCAGGTGAATGTATAGCATGATGATGGAAACTGACCTCCATTCCCCTGGCCCCCAAACGAACACTACCACGGGGCAAACGGGGCCGAACAGCGGGTCCAAAGCGAATCAGGAACGGGTGAAGAGACCGATGAACGCGTTCATGGTGTGGTCCCGTGGGCAGCGGAGAAAGATGGCACAGGAGAACCCCAAAATGCACAACTCAGAGATCAGCAAACGGCTGGGCGCCGAGTGGAAGGTGATGTCCGAGGCTGAGAAGCGCCCTTTCATCGACGAGGCGAAACGGTTGCGAGCTATGCACATGAAGGAGCATCCGGATTACAAGTACCGGCCAAGACGGAAGACCAAGACGCTGCTGAAAAAGGACAAATATTCCCTTGCCGGTGGACTGCTTTCTGGCCCCAGTGGGGGCGGTGGAGTTGGTTTAGGGGTCGGAATGAGTTCATCCGGGGTCGGGCAGAGGTTAGAGAGCCCCGTGGGTCACGTAGGCTCCGCCAGCTCCGGCTATGCGCACATGAACGGCTGGGCGAACGGTGCGTACTCGGGGCAGGTGGCTGCAGCCGCGGCGGCGGCAGCGATGATGCAGGAGGCTCAGCTAGCCTACAGCCAGCACCCGGGGAGCGGAgcgcaccaccaccaccactcacACCACCATCACTCACACAACCCCCAGCCCATGCACCGCTACGACATGACAGCCCTGCAGTACAGCCCCATCTCGAACTCTCAGGGCTACATGAACACTTCTCCATCCGGCTACGGTGGGATCACCTATACACAGCACCAGGGCTCCGGCGTCTCCTCCTCAGCTGCCATGGGGACGTTAGGGTCGCTCGTGAAGTCGGAGCCGAGTatttCTCCGTTGTTGGAATAGCATCTGCACTGGAACTTTTTGACAAAAGCTTGTTccactgcccccaagtggccaaaaactTATTTATGCAGCTTTAAGTGGAGTAGCAAATAGCTGCAGCACCAGAGGCTGCACTTTCAGGACCACATTTTCAGGATCACAATTCTAGGACCCTAACTAGAGGGCGGTTCTTGCAAAGAAAGCGAGTGGGATTCATAATGACGCGACAGGACCGCATTTCCACCAAAAATGGAAGAAGAGCACGTCAAGCCAAGTGTGAGTAACATTAGCTACTGATTGTGAATAacgttatttttatttgtttaacgtTACTGTGAAAACCATTGGTGTCTTGAATTGGACAGTaaagtaacgttagctgtctAGCCGAAGTGTGAGTGAGTTAACTAACGTAAACATTCTATTAAACGTCTTAAAGTTCCTAACGTTACAATCATCgttagcttgcttgctaacTGTTAGCCCAAAGTATTTGTTTAATGAAACGTAACGCTTTAGCTATCTTTGGTGAGCCCAACTAGGTGTATTATATTACTTGTTAATTTTATGActataataattaattataGCCAGCACTTTACTTCAAAAGATATAAATGCCTGTCTTCAGTGTACGTGAATGTTCATTGTTATACCTGGGACAGGTATTTCAGGTTTTTTTACACAATGCCATGCAATCTTATCAACTATTCACTAATATAAGTTAAGACCTTAATTAAGAAGTAGTAATGAATGCATCTTTTTTAAGTGTTACCAATATGTTTATAACCCATCACTGATTATGTTGTCCACTTACTATTTCAGTGAAGTGGAGGAATTCTGCGGACGATCggaaaaacaaatgagaaaaGAAGGAAGACCACTCAATTAAAAGGTAGTGCTCTTAACTTCTTCATAATGTGGGTTTCAATATTGAAGGGTCAGGAAAAGTcaaatgttaataaaatgtCATTTCTATTGTTTTCTtgcatctctctgtctctctctctctcgcaggAATCATTGAGACTTGAAAGTGTTCGATTATGTGAAGAACAGAAACGGCAGACAAGAAATAATTAAAACTGTCATTCTCCTATTTCATGTGGAAATTGCCaaaattaaaaactttttgACTTATTtgcttgctttttttctttg
>XR_005641085.1
ATGCAAATCACTCTGCTGCTAGAGCCTCATTGGCCGCTTGTGGCTCATTTAACCGCTGTCAGTGCGCATCATATGGCCGCCGCTTTAACCAACTTCTCAACTCCGACAGGGAAAGTAGCTCAATGCGCATTCTATCACAGTGGATGTGAACCAACCCATGCGAAGACTGGGTGAAATACTGTTTGAGGAACATTTTATATCCTTTTTCGGATTATTTTTTATCTCGTATTTGAATCATTTGGGTTTAGAATTTTAATTTTGCAACCAAGTCATCCTGTGCACACCCGATCTTGTGCGCGCCTATCTTGAAATATACTTTTTCTCTGTTGGTGGCGAAGTTTAAAATCAGGTGAATGTATAGCATGATGATGGAAACTGACCTCCATTCCCCTGGCCCCCAAACGAACACTACCACGGGGCAAACGGGGCCGAACAGCGGGTCCAAAGCGAATCAGGAACGGGTGAAGAGACCGATGAACGCGTTCATGGTGTGGTCCCGTGGGCAGCGGAGAAAGATGGCACAGGAGAACCCCAAAATGCACAACTCAGAGATCAGCAAACGGCTGGGCGCCGAGTGGAAGGTGATGTCCGAGGCTGAGAAGCGCCCTTTCATCGACGAGGCGAAACGGTTGCGAGCTATGCACATGAAGGAGCATCCGGATTACAAGTACCGGCCAAGACGGAAGACCAAGACGCTGCTGAAAAAGGACAAATATTCCCTTGCCGGTGGACTGCTTTCTGGCCCCAGTGGGGGCGGTGGAGTTGGTTTAGGGGTCGGAATGAGTTCATCCGGGGTCGGGCAGAGGTTAGAGAGCCCCGTGGGTCACGTAGGCTCCGCCAGCTCCGGCTATGCGCACATGAACGGCTGGGCGAACGGTGCGTACTCGGGGCAGGTGGCTGCAGCCGCGGCGGCGGCAGCGATGATGCAGGAGGCTCAGCTAGCCTACAGCCAGCACCCGGGGAGCGGAgcgcaccaccaccaccactcacACCACCATCACTCACACAACCCCCAGCCCATGCACCGCTACGACATGACAGCCCTGCAGTACAGCCCCATCTCGAACTCTCAGGGCTACATGAACACTTCTCCATCCGGCTACGGTGGGATCACCTATACACAGCACCAGGGCTCCGGCGTCTCCTCCTCAGCTGCCATGGGGACGTTAGGGTCGCTCGTGAAGTCGGAGCCGAGTatttCTCCGTTGTTGGAATAGCATCTGCACTGGAACTTTTTGACAAAAGCTTGTTccactgcccccaagtggccaaaaactTATTTATGCAGCTTTAAGTGGAGTAGCAAATAGCTGCAGCACCAGAGGCTGCACTTTCAGGACCACATTTTCAGGATCACAATTCTAGGACCCTAACTAGAGGGCGGTTCTTGCAAAGAAAGCGAGTGGGATTCATAATGACGCGACAGGACCGCATTTCCACCAAAAATGGAAGAAGAGCACGTCAAGCCAAGTTGAAGTGGAGGAATTCTGCGGACGATCggaaaaacaaatgagaaaaGAAGGAAGACCACTCAATTAAAAGgAATCATTGAGACTTGAAAGTGTTCGATTATGTGAAGAACAGAAACGGCAGACAAGAAATAATTAAAACTGTCATTCTCCTATTTCATGTGGAAATTGCCaaaattaaaaactttttgACTTATTtgcttgctttttttctttg

Function


NR:

description
PREDICTED: transcription factor Sox-1a isoform X2

GO: NA

KEGG:

id description
K09267 SOX1S; transcription factor SOX1/3/14/21 (SOX group B)

RNA


RNA id representative length rna type GC content exon number start site end site
XM_039819002.1 False 2188 mRNA 0.47 3 21619349 21634757
XM_039819003.1 True 2356 mRNA 0.46 3 21619348 21627452
XR_005641085.1 False 1686 transcript 0.51 5 21619348 21627452

Neighbor


gene id symbol gene type direction distance location
arhgef7a arhgef7,LOC103372297,LOC102301399,LOC106582025 coding upstream 69313 21518502 ~ 21550035 (+)
mrpl30 mrpl30,LOC102785580 coding upstream 147680 21468365 ~ 21471668 (+)
si:dkey-4e7.3 LOC103368133 coding upstream 212447 21398530 ~ 21406901 (+)
inpp4aa inpp4a,LOC106581879 coding upstream 232792 21369820 ~ 21386556 (+)
mfsd9 mfsd9 coding upstream 266691 21345532 ~ 21352657 (+)
mcf2lb LOC100707473,LOC101480000,LOC102199374,LOC103372300 coding downstream 57464 21692221 ~ 21742235 (+)
phf11 NA coding downstream 108991 21743748 ~ 21755526 (+)
trim13 trim13 coding downstream 158517 21793274 ~ 21809912 (+)
rnaseh2b rnaseh2b coding downstream 213894 21848651 ~ 21854501 (+)
LOC120570102 LOC103362603,LOC104951867 coding downstream 226503 21861260 ~ 21863025 (+)
G126000 NA non-coding upstream 137323 21479929 ~ 21482025 (+)
G125994 txndc9 non-coding upstream 142598 21473163 ~ 21476750 (+)
G125992 mitd1,LOC102785871 non-coding upstream 151216 21465910 ~ 21468132 (+)
G125958 NA non-coding upstream 209383 21409192 ~ 21409965 (+)
G125957 NA non-coding upstream 210235 21407513 ~ 21409113 (+)
G126160 LOC103371890 non-coding downstream 716270 22351027 ~ 22352358 (+)
G126162 LOC103371890 non-coding downstream 721337 22356094 ~ 22357669 (+)
G126169 NA non-coding downstream 750772 22385529 ~ 22385754 (+)
G126175 NA non-coding downstream 810571 22445328 ~ 22446121 (+)
G126246 NA non-coding downstream 900374 22535131 ~ 22537513 (+)
G125757 rpl24 other upstream 753906 20860152 ~ 20865442 (+)
mpc2b mpc2 other upstream 760432 20853089 ~ 20858916 (+)
G125627 LOC108259060,LOC105016494,LOC108266561,LOC104955100 other upstream 1098074 20492178 ~ 20521274 (+)
myo16 myo16,LOC103372229 other upstream 1353758 20153855 ~ 20265590 (+)
LOC120569357 LOC103369896,LOC102289365,LOC102197623,LOC101484406 other upstream 1626024 19968360 ~ 19993324 (+)
LOC120569986 LOC103372298,LOC100698342,LOC102199663,LOC102299751,LOC101469554,LOC102782232 other downstream 20369 21655126 ~ 21691804 (+)
fbxl3a fbxl3 other downstream 1096166 22730923 ~ 22738653 (+)
G126363 lmo7 other downstream 1172436 22807193 ~ 22815102 (+)
dachd LOC107389725,LOC105927577,LOC103357501,LOC103134744 other downstream 1343845 22978602 ~ 23100083 (+)
cwc22 cwc22 other downstream 2549192 24183949 ~ 24199835 (+)

Expression



Co-expression Network