HSF1, Our Favorite Yeast Gene

(Saccharomyces cerevisiae)

Arash Lari & Mary Balducci

SGD

Standard Name: HSF1
Systematic name: YGL073W
Name Description: Named because it codes for one of two Heat Shock transcription Factors

Website
ID#

DNA Sequence:


   1 ATGAATAATG CTGCAAATAC AGGGACGACC AATGAGTCAA ACGTGAGCGA TGCTCCCCGT
  61 ATTGAGCCTT TACCAAGCTT GAATGATGAT GACATTGAAA AAATCTTACA ACCGAACGAT
  121 ATCTTTACGA CCGATCGTAC CGATGCAAGT ACTACATCTT CCACAGCCAT TGAAGATATT
  181 ATTAACCCCT CATTGGATCC GCAGTCAGCA GCATCGCCGG TTCCTTCTTC CTCTTTTTTC
  241 CATGACTCAA GGAAACCTTC CACCAGTACA CATTTAGTAA GGAGAGGTAC TCCATTGGGA
  301 ATTTACCAAA CCAATCTATA CGGTCACAAT AGCAGAGAAA ATACTAATCC TAATAGTACA
  361 TTATTATCTT CTAAGTTACT CGCGCATCCA CCAGTTCCTT ATGGGCAAAA TCCCGATTTA
  421 CTACAACATG CTGTGTACAG GGCACAGCCG TCAAGTGGAA CCACTAACGC GCAACCGCGC
  481 CAAACCACAA GAAGATATCA ATCCCATAAA TCACGGCCTG CATTTGTTAA TAAACTATGG
  541 AGCATGTTAA ACGATGATTC TAATACGAAA CTTATACAGT GGGCGGAGGA TGGAAAATCT
  601 TTTATTGTCA CGAATAGGGA GGAATTTGTG CACCAAATTT TACCAAAATA TTTTAAACAT
  661 TCCAATTTCG CTTCCTTTGT AAGACAATTG AACATGTATG GATGGCATAA AGTTCAAGAT
  721 GTCAAGTCAG GATCAATTCA AAGTAGTTCA GATGATAAGT GGCAATTTGA AAATGAAAAC
  781 TTCATTAGAG GTAGAGAAGA TTTGCTGGAA AAAATAATCA GGCAGAAAGG TTCCTCCAAT
  841 AACCATAATA GCCCTAGTGG TAACGGTAAT CCAGCGAATG GTAGCAACAT CCCTCTGGAC
  901 AATGCCGCAG GAAGTAATAA TAGCAATAAT AACATCAGTA GTAGTAATTC ATTTTTTAAC
  961 AATGGTCATT TATTGCAGGG TAAAACACTA AGATTAATGA ACGAAGCGAA TCTTGGAGAT
  1021 AAGAATGATG TCACCGCGAT TTTGGGGGAA TTAGAGCAAA TAAAATATAA CCAGATTGCA
  1081 ATTTCCAAAG ATTTACTAAG AATAAACAAA GATAATGAGT TATTATGGCA AGAGAATATG
  1141 ATGGCCAGGG AAAGACATAG AACCCAACAG CAAGCCTTGG AAAAAATGTT CAGATTCTTG
  1201 ACATCTATAG TCCCACACTT AGATCCCAAA ATGATTATGG ACGGGCTGGG AGATCCGAAA
  1261 GTTAATAATG AAAAGCTAAA CAGTGCGAAT AACATTGGGT TAAATCGCGA CAACACAGGC
  1321 ACTATAGATG AACTAAAATC CAACGATTCT TTCATAAACG ATGATCGTAA TTCTTTCACC
  1381 AATGCTACAA CCAACGCCCG TAATAACATG AGTCCCAACA ATGATGACAA TAGTATTGAC
  1441 ACCGCTAGCA CTAATACCAC CAACAGAAAG AAAAATATAG ATGAAAACAT CAAAAATAAC
  1501 AACGACATAA TTAATGACAT TATATTTAAT ACCAACCTTG CCAACAATCT CAGCAATTAC
  1561 AATTCCAACA ATAATGCTGG CTCGCCAATA AGGCCCTATA AACAAAGATA TCTTTTGAAA
  1621 AATAGAGCCA ATTCCTCGAC ATCGAGTGAG AATCCAAGCC TAACGCCCTT TGATATCGAA
  1681 TCTAATAATG ACCGCAAAAT TTCAGAAATT CCTTTTGATG ACGAAGAAGA AGAAGAAACG
  1741 GATTTTAGGC CTTTTACCTC GCGAGATCCT AATAACCAAA CGAGTGAAAA CACTTTTGAT
  1801 CCAAACAGAT TTACGATGCT CTCTGATGAT GATTTAAAAA AAGATTCTCA TACCAATGAC
  1861 AATAAACACA ACGAAAGTGA TCTTTTTTGG GACAACGTAC ATAGAAATAT AGACGAACAA
  1921 GATGCAAGAC TCCAGAACTT GGAAAATATG GTTCACATAC TTTCTCCTGG ATATCCTAAT
  1981 AAGTCGTTCA ACAACAAAAC TTCCTCGACA AACACTAATT CCAATATGGA AAGTGCTGTC
  2041 AACGTTAATA GCCCTGGTTT CAACTTACAG GATTATTTAA CTGGAGAGTC TAATTCCCCC
  2101 AATTCTGTTC ATTCTGTTCC CTCCAATGGC AGCGGCTCCA CACCGTTGCC CATGCCAAAT
  2161 GATAATGACA CCGAGCACGC AAGTACAAGT GTCAATCAAG GCGAAAATGG AAGCGGATTA
  2221 ACGCCCTTCC TCACGGTAGA TGATCACACA CTAAACGACA ATAACACTAG TGAGGGAAGT
  2281 ACAAGGGTGT CCCCCGATAT AAAGTTCAGC GCCACTGAAA ACACTAAAGT GAGTGATAAC
  2341 CTGCCAAGCT TTAATGACCA CAGTTATTCC ACCCAGGCCG ACACGGCGCC CGAGAACGCT
  2401 AAGAAAAGAT TTGTGGAGGA AATACCGGAA CCGGCTATAG TCGAAATACA GGACCCGACA
  2461 GAGTACAACG ATCACCGCCT GCCCAAACGA GCTAAGAAAT AG

Protein Sequence


  1 MNNAANTGTT NESNVSDAPR IEPLPSLNDD DIEKILQPND IFTTDRTDAS TTSSTAIEDI
  61 INPSLDPQSA ASPVPSSSFF HDSRKPSTST HLVRRGTPLG IYQTNLYGHN SRENTNPNST
  121 LLSSKLLAHP PVPYGQNPDL LQHAVYRAQP SSGTTNAQPR QTTRRYQSHK SRPAFVNKLW
  181 SMLNDDSNTK LIQWAEDGKS FIVTNREEFV HQILPKYFKH SNFASFVRQL NMYGWHKVQD
  241 VKSGSIQSSS DDKWQFENEN FIRGREDLLE KIIRQKGSSN NHNSPSGNGN PANGSNIPLD
  301 NAAGSNNSNN NISSSNSFFN NGHLLQGKTL RLMNEANLGD KNDVTAILGE LEQIKYNQIA
  361 ISKDLLRINK DNELLWQENM MARERHRTQQ QALEKMFRFL TSIVPHLDPK MIMDGLGDPK
  421 VNNEKLNSAN NIGLNRDNTG TIDELKSNDS FINDDRNSFT NATTNARNNM SPNNDDNSID
  481 TASTNTTNRK KNIDENIKNN NDIINDIIFN TNLANNLSNY NSNNNAGSPI RPYKQRYLLK
  541 NRANSSTSSE NPSLTPFDIE SNNDRKISEI PFDDEEEEET DFRPFTSRDP NNQTSENTFD
  601 PNRFTMLSDD DLKKDSHTND NKHNESDLFW DNVHRNIDEQ DARLQNLENM VHILSPGYPN
  661 KSFNNKTSST NTNSNMESAV NVNSPGFNLQ DYLTGESNSP NSVHSVPSNG SGSTPLPMPN
  721 DNDTEHASTS VNQGENGSGL TPFLTVDDHT LNDNNTSEGS TRVSPDIKFS ATENTKVSDN
  781 LPSFNDHSYS TQADTAPENA KKRFVEEIPE PAIVEIQDPT EYNDHRLPKR AKK*

Summary of the Gene

The HSF1 gene is a transcription factor, meaning it is a gene that regulates the expression of other genes. Specifically, HSF1 regulates genes in response to stresses such as heat, pH imbalance, starvation, or other stressors. HSF1 regulates a large range of genes which are involved in things like protein folding, metabolism, protein transport, etc. in order to protect the cell during while in stress. This gene is in yeast, but it is also in other organisms including mammals, birds, and plants. In yeast, the removal of this gene is fatal and mutation of it usually results in defects of the cell.This gene encodes a transcription factor in response to stress, especially heat shock related stress. This gene is necessary for the yeast’s survival. In cells which are not under stress, the gene is not active, but if the cell becomes under stresses such as heat, pH, or starvation, it becomes active to regulate the transcription of other genes, which work to protect the organism.

More Information

Differences:

The Saccharomyces Genome Database by far had the most information on this gene compared to the other three databases. The SGD website says that HSF1 is inactive in normal conditions, but the UniProt website says that this gene is necessary for normal growth. The NCBI Gene website has the same information as the SGD site, with little or no differences. The Ensembl website also does not have very much information on the function of the gene, what it did have was the same as the SGD website. I could not find anything under “Gene Expression” on this website.

Presentation:

The SGD website presented the information very clearly. The tabs at the top as well as the side menu made it easy to navigate each section of the website to find the information I was looking for. The UniProt website was also pretty simple, it had a list on the side where I could choose what I wanted to see, which also made finding information easy. The NCBI website also had a table of contents where I could click to which section i wanted, making it easy to find their information on the gene’s function. The Ensembl website also had a menu, but it was slightly less simple. Some of the categories in the menu did not come up with any results when I clicked them.

Why did we choose this gene?

We chose this gene by reading through the blog posts on the SGD website. They mentioned that this one was used in cases of heat stress. I think it’s interesting that there are genes that are prepared to regulate other genes in response to high levels of heat or other stressors, so I chose this one. My partner agreed that this was interesting and we both wanted to learn more.


UniProt

Loading…

YeastMine (SGD)

Loading…
Screenshot of bread
An image of bread to represent yeast, as there were no pictures of hsf1 that were ok to use