UCSC Genome Bioinformatics

Downloading Data using MySQL

We have a MySQL database for public access at genome-mysql.cse.ucsc.edu.

This server allows MySQL access to the same set of data currently available on our public Genome Browser site. The data are synchronized weekly with the main databases on our public site. During synchronization, the MySQL server can be intermittently out of sync with the main website for a short period of time. The weekly synchronization takes place on Monday mornings from 4:00 am to 9:00 am Pacific Time (GMT -7:00).

Connecting

You must have MySQL client libraries installed on your computer. We recommend using the most current version of the v5.0 MySQL clients.

Connect to the MySQL server using the command:

mysql --user=genome --host=genome-mysql.cse.ucsc.edu -A

The -A flag is optional but is recommended for speed.

Once connected to the database, you may use a wide range of MySQL commands to query the database.

Conditions of Use

  • Avoid excessive or heavy queries that may impact the server performance. Inappropriate query use will result in a restriction of access. If you plan to execute a query that you think may be excessive, contact UCSC first to avoid the possibility of having your access blocked.
  • Bot access and excessive program-driven use are not permitted.
  • Attachments by local mirror sites are prohibited.

Using the MySQL Server with our Utilities

The MySQL database can also be used by the numerous utilities in the Genome Browser source tree. Some of these utilities require a password, so you will need to add the following specifications to your $HOME/.hg.conf file (remember to chmod your .hg.conf file to 600 permissions):

db.host=genome-mysql.cse.ucsc.edu
db.user=genomep
db.password=password
central.db=hgcentral

If you prefer a graphical user interface (GUI) to the UCSC database tables, use the Table Browser.

System problems should be reported to genome-www@soe. ucsc. edu. Send questions regarding the database contents or queries to genome@soe. ucsc. edu. Messages sent to this address will be posted to the moderated genome mailing list, which is archived on a SEARCHABLE PUBLIC Google Groups forum.