Stacy Hung
Co-op Student

Contact information
sshung@students.cs.waterloo.ca

Education
September 2002-present
B.Sc. Honours Bioinformatics and Biology
University of Waterloo


 

Research Projects

Fall Term (September-December 2005)

1)Refactoring of Pathology Input Tool
Due to minor modifications that were made to the pathology database tables, I refactored the Pathology Input Tool to reflect these changes.

2)Adding New Functionality to TCT
For the primary purpose of being able to further visualize and interpret coactivity, I refactored the TCT to support a new functionality called the coactivity interpretation tool (CAIT). CAIT is based on the concept of coactivity for an ontological pair. The TCT now allows the coactivity, displacement, and average activity indices to be displayed as a heat map by overlapping with the original TCT plot.

3)Tree Map
My largest project for the term, the TreeMap, is a visualization for hierarchical data. The Tree Map takes (pathology) data from either dbZach or an input file and parses the data into a tree-like structure. I have developed and implemented the recursive algorithms for three layouts: Clustering, Slice-and-Dice, and Squarified.

4)Replication of Yan Sun's DRE Search
This ongoing project is to continue replicating an experiment done by a previous post-doc, Yan Sun. The experiment involved searching for dioxin response elements (DREs) in Human, Mouse, and Rat genomic sequence. I have created a small Java application that finds and reports DRE occurrences in 20 000 15 000 bp sequences generated using a Mersenne Twister random number generator. I have replicated some of the bioinformatics components of the experiment, including statistical analyses in R, and analysing the occurrence of DREs in orthologous genes requiring extensive querying of data from Ensembl and HomoloGene.

5)Software Comments Interface
This project is a Java WebSerlvet to allow members of the lab to submit comments about in-house software. The interface requires the incorporation of Java’s Web Development kit, JDBC, as well as SQL queries to insert software comments into dbZach.

Winter Term (January-April 2005)

1)Pathology Input Tool
This first project is a Java application that allows pathology data produced in the lab to be inputted into dbZach. Implementation required the leveraging of Swing, OJDBC, and JExcel libraries. Through SQL queries, the Pathology Input Tool inserts pathology lab and histology data in the form of a text file or Excel spreadsheet.

2)Toxicogenomics Correlation Tool (TCT)
This second project is a visualization tool to provide a means of interpreting data from cross-comparison studies, such as that for mouse and rat orthologues. This Java application reads raw gene expression and activity data text files, calculates the correlation data, and plots the correlation data. The TCT supports correlation filters, and data output for user-selected points on the plot.

3)K-Means and Improved K-Means Clustering
This third project involves two implementations of K-Means clustering visualizations. The first implementation is essentially the classical K-Means algorithm which supports the Euclidean, Manhattan, Minkowski, and correlation distance metrics. The second implementation, called the Improved K-Means, builds on the first by performing a specified number of iterations of the classical K-means. Using these iterations, the Improved K-Means determines how often each gene occurs in the same cluster, and displays this information as a linkage plot.

4)Electrophoresis gels
Throughout the Winter term, I performed many electrophoresis gels for Jeremy’s Primer project. I also did RNA gels for Cora, and verification of restriction enzymes for Darrell.