Emilysimso Week 12

From LMU BioDB 2015
Jump to: navigation, search

Goals

  • Normalize data, statistical analysis of raw data

File Notes

  • Will work on all "F" files
  • 52 chips total - 104 files (one for Cy3, one for Cy5)
  • sdrf file has 208 - duplicates
  • Need columns G, K, L, H
  • Log2 (Cy5 signal median-Cy5 signal background / Cy3 signal median - Cy3 signal background)

Files

Choosing Appropriate Columns

  • Downloaded each raw data file
  • Opened it using Xcel
  • Deleted unnecessary information at the top of the raw data tab
  • Created new tab with "Needed Columns" label
  • Transferred columns G, H, K, and L to this tab
  • Saved the file as a .xlsx file and uploaded to the wiki


Consolidating Files

  • Combined the Cy3 and Cy5 files for each time point onto a single spreadsheet
  • In each new spreadsheet:
    • Sheet 1 = Cy3
    • Sheet 2 = Cy5
    • Sheet 3 = Combined
      • Added _Cy3 to columns A-D and _Cy5 to columns E-H on row titles to differentiate the samples
  • Calculated the Log2 for each of the following files in Sheet 4
    • Copied over all of the data from Combined sheet
    • Column I labeled (Cy5 signal median - Cy5 background median)
      • I2=G2-H2
    • Column J labeled (Cy3 signal median - Cy3 background median)
      • J2=C2-D2
    • Column K labeled (Cy5/Cy3)
      • K2=I2/J2
    • Column L labeled Log2
      • performed =LOG(K2, 2)



Weekly Assignment Information

User: Emilysimso

Assignments

Individual Journal Entries

Class Journal Entries

Group Project

Heavy Metal HaterZ