This project requires reading, following instructions from documents, and some coding in R, a statistical program available for free on the internet.
This project will probably be paid by the hour. Example of questions:
a) "p16" is a known tumor suppressor gene involved in cell cycle regulation. Use NCBI database to find its official gene symbol as "CDKN2A". What other aliases this gene has? What is the Entrez ID of this gene? (print the page where you find the information).
b) Use Map Viewer to retrieve sequence information of "CDKN2A" gene (with 2Kbp upstream and downstream regions) in Homo sapiens (human) and save it as a text file named "[url removed, login to view]". How long (how many base pairs) is this sequence?
c) Use Bioconductor to similarly retrieve sequence of "CDKN2A" with 2Kbp upstream and downstream. How long is this sequence? (There should be two alternative splicing forms of this gene. Pick the one identical to that from NCBI database.)
The assignment requires the reading of documents. That reading will take between 1 and 4 hours.
You can indicate those questions you are comfortable you can answer, and those questions you are not comfortable working on.