If you are interested in participating this Coordination Action please contact us!
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
public:hack4 [2013/08/26 12:07] – andreas | public:hack4 [2019/02/12 09:04] (current) – external edit 127.0.0.1 | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====Hackathon | + | ====Hack-a-thon |
+ | ==19 - 21 September 2013 in Amsterdam, the Netherlands== | ||
+ | \\ | ||
[[public: | [[public: | ||
[[public: | [[public: | ||
- | TC Leader: Oren Tzfadia and Erik Alexandersson | + | {{:public: |
+ | Go to Google maps. Find Watergraafsmeer in Amsterdam and in Watergraafsmeer the Science Park. In the top right corner of the triangular half of the science park you find a boomerang-shaped building. That's it. | ||
+ | |||
+ | SARA and the eScience Centre share the building. Google places SARA at the location where we will get a room for the hack-a-thon, | ||
+ | |||
+ | |||
+ | |||
+ | [[http:// | ||
+ | Eerste Ringdijkstraat 4 | ||
+ | 1097 BC Amsterdam | ||
+ | |||
+ | TC Leaders: Oren Tzfadia and Erik Alexandersson | ||
+ | |||
+ | **Participants: | ||
+ | Monika Brandt\\ | ||
+ | Agnieszka Danek (Silesian University of Technology, Gliwice, Poland)\\ | ||
+ | Estelle Wera (SLU)\\ | ||
+ | Itziar Frades (SLU)\\ | ||
+ | Didi Amar (Weizmann Institute)\\ | ||
+ | Tatyana Goldberg (Technische Universitaet Muenchen)\\ | ||
+ | Erik Alexandersson (SLU)\\ | ||
+ | Oren Tzfadia (Weizmann Institute)\\ | ||
+ | Sanjeev Kumar Sharma (James Hutton Institut)\\ | ||
+ | Gregoire Rossier (SIB) | ||
+ | |||
+ | |||
+ | It will start Thursday Sept 19 at 12:00 at SARA. | ||
+ | |||
+ | We will end Saturday with lunch (around 13). | ||
+ | |||
+ | **Provisional program, schedule is flexible** | ||
+ | |||
+ | ** | ||
+ | //Wednesday 18 September// | ||
+ | Arrival of first participants\\ | ||
+ | |||
+ | **// | ||
+ | morning: | ||
+ | 12: | ||
+ | 12: | ||
+ | **13:00 - 18: | ||
+ | 13: | ||
+ | 13: | ||
+ | 13: | ||
+ | 14: | ||
+ | 14: | ||
+ | 16: | ||
+ | 16: | ||
+ | 17: | ||
+ | 18: | ||
+ | 20: | ||
+ | |||
+ | **//Friday 20 September// | ||
+ | < | ||
+ | 8:30: Taxi to SARA\\ | ||
+ | **9:00 - 12: | ||
+ | 9:00: Refine scripts after analysis of 1st round results. Re-running the validation score. Define ‘stable’ scores and scoring schemes\\ | ||
+ | 10: | ||
+ | 11: | ||
+ | 12: | ||
+ | **13:30 - 18: | ||
+ | 13: | ||
+ | 16: | ||
+ | 17: | ||
+ | 18: | ||
+ | 19:30 - 20:30 Free slot if needed\\ | ||
+ | 20: | ||
+ | |||
+ | // | ||
+ | < | ||
+ | 8:45: Check out\\ | ||
+ | **9:00 - 12: | ||
+ | 9:00: Recap video conference with Kate\\ | ||
+ | 9:45: Hackathon summary and perspectives\\ | ||
+ | 12: | ||
+ | 13: | ||
+ | |||
+ | |||
+ | |||
+ | **Overview** | ||
+ | |||
+ | For non-model organisms, genes predicted in the sequenced genome are relatively poorly functionally annotated. Instead, researchers have to rely on information derived from sequence identity to model organisms. | ||
+ | |||
+ | We aim to gather information from several complete sequenced plant genomes and bind/modify existing tools and pipelines for efficiently analyzing large scale ‘-omics data’. By that, we seek to generate a robust and automated framework to assign genes into functional categories, and classify them in a biological context such as biological pathways. | ||
+ | |||
+ | |||
+ | Another challenge is to compare, link and annotate transcript sequences derived from RNA-seq of not yet sequenced genomes with already sequenced genomes. The potato genome was sequenced last year and it is time to ripe for genome comparisons, | ||
+ | |||
+ | Currently, the OrthoMCL clusters are used for gene family analysis together with BLAST. Visualisation of ‘-omics’ data has been done in a commercial software, QluCore, but this does not handle multiple data types simultaneously well and does not visualise functional pathways. Gene predictions were done ab initio with parameters trained for A. thaliana and also based on sequence similarity with four other plant genomes. Functional annotation of predicted genes was done by identifying orthologous and paralogous gene families in 12 sequenced plant species by OrthoMCL. | ||
+ | |||
+ | Available data: RNA-seq (GSII Illumina pair.end reads) exist for 3 different potato cultivars and 3 wild potato species. In addition, gene expression data (Agilent microarray based on the 3.4 version of the potato genome) and secretome quantitative proteomics data from various states exist (all samples are from leaves). | ||
+ | |||
+ | |||
+ | |||
+ | **Rough Sketch of the Plan before hack-a-thon: | ||
+ | |||
+ | * Share and review project “blue print” by all Hack-a-thiners. | ||
+ | * Data collection. | ||
+ | * Building virtual “warehouse” of relevant existing tools. | ||
+ | * Comprehensive related literature survey. | ||
+ | * Data pre-processing and protocols set up. | ||
+ | * Simulations of “dry” (practice) runs for optimizing “wet” (real) runs to be performed at the time of the hack-a-thon meeting in Amsterdam. | ||
+ | * Define computational needs (computer clusters, memory usage, and required packages installments). | ||
+ | |||
+ | |||
+ | During the hack-a-thon, | ||
+ | - Trinotate | ||
+ | - Blast2GO | ||
+ | - PotatoCyc | ||
+ | - OrthoMCL / InParanoid | ||
+ | - Phytozome | ||
+ | - KEGG | ||
+ | |||
+ | Two things are CRUCIAL for us to complete BEFORE going to Amsterdam: 1) Collecting outputs from each pipeline. 2) Parse the output of each pipeline to get a tab delimited file that looks like so: GeneID (Potato ID) Term (GO term ID OR E.C ID) Score (if available p-value/ |