Introduction
An IGB user contacted us with a request to add a new genome to the IGB QuickLoad system - the Xenopus tropicalis genome assembly dated November, 2009, also called JGI 4.2/xenTro3. (See: http://genome.ucsc.edu/cgi-bin/hgGateway?hgsid=262072157&clade=vertebrate&org=X.+tropicalis&db=0).
Because this genome is supported at the UCSC Genome Browser, adding it to the system should be fairly straightforward. All we need to do is download the data, format it, and move it onto the main IGB QuickLoad site currently hosted at UNC Charlotte, in the Bioinformatics and Genomics Department's server room.
Methods
Step one is to download the sequence data.
The UCSC Web site provides this page that lists the different organisms supported in their system. I followed the link to the X. tropicalis genome and clicked the link labeled Full data set, which took me to the "bigZips" page here: http://hgdownload.cse.ucsc.edu/goldenPath/xenTro3/bigZips/.
Fortunately, it appears that UCSC already provides the fasta sequence data in 2bit format, saving me a step.
First, I added a new directory for this genome to my checked-out copy of the IGB QuickLoad subversion repository, like so:
$ svn mkdir X_tropicalis_Nov_2009
A X_tropicalis_Nov_2009
$ svn ci X_tropicalis_Nov_2009 -m "Adding new genome for frog; this is the same genome as JGI 4.2/xenTro3"
Adding X_tropicalis_Nov_2009
Committed revision 437.
Next, I changed in the new directory and then downloaded this 2bit file using wget in the usual way:
Once the file downloaded, I used twoBitInfo to make a "genome.txt" file reporting the names of the assembled chromosomes and contigs and their sizes.
To be continued.....