Log in

No account? Create an account

Previous Entry | Next Entry

I haven't previously used this blog to announce courses per se, but seeing as folks are paying attention and time is getting short, I would like to bring to your attention the following:

The Kansas State University bioinformatics curriculum committee has been working on a couple of undergraduate-level courses on computational methods (particularly algorithms, numerical analysis, and visualization techniques) for biological applications. Some faculty in the College of Engineering have expressed interest in having such a course for the graduate students. The following is a 6-week course on basics of data mining for computational biology, to be offered in June and July. We would appreciate it if you would pass this along to any interested students (and if you are a k_state student reading this and are interested in registering, please send me e-mail at hsuwh[AT]hotmail.com).

CIS 690 - Data Mining in Bioinformatics

Semester hours: 3
Reference number: 07560
Dates: Wed 01 Jun 2005 - Fri 15 Jul 2005
Time: MTWUF, 14:30 - 16:00
Format: 50 minutes lecture, 35 minutes lab daily
Venue: 236 Nichols Hall

Course Description: This 6-week course covers fundamentals of data modeling and mining with an emphasis on applications in computational biology, and
will be of interest to the undergraduate or graduate student in science, engineering, mathematics, or statistics who is seeking background in basic
data mining techniques. Topics to be covered include fundamentals of machine learning, pattern recognition, Bayesian methods, development and
application of relational databases, and visualization of data and clustering output. Programming background at the level of a first course in
computer science is required; no other background in mathematics, molecular biology or genetics is assumed. This course will emphasize analysis of
sequence data and gene expression data, but students with other interests in data mining are welcome to enroll and may select other project topics.


  • Required: first course in programming (CIS 200 or equivalent)

  • Recommended: first course in probability and statistics (STAT 510 or 410)

Grading: 20% midterm, 20% homeworks (2), 10% paper reviews, 50% project
Textbook: Data Mining (2000) by Witten and Frank


  • Data in bioinformatics (throughout course)

    • Microarrays

    • Sequence data: protein and nucleotide sequences

    • Expressed Sequence Tags (ESTs) and tag libraries

    • Sources of data: GenBank, PDB/SwissProt, Stanford Microarray Database; PubMed

  • Problems in computational biology (2 lectures intro; throughout course)

    • Modelling gene networks and pathways

    • Biochemical pathways and signal transduction

    • Protein-protein interactions

    • Protein secondary and tertiary fold prediction

    • Phylogenetic modeling

  • Fundamentals of machine learning (1 week)

    • Supervised inductive learning algorithms: a priori (association rules), decision trees, Naive Bayes

    • Relevance determination and feature selection

  • Supervised machine learning algorithms for bioinformatics (1.5 weeks)

    • Sequence learning: hidden Markov models (HMMs)

    • Bayesian networks: structure learning and parameter estimation

    • Kernel methods: maximum margin and support vector machines

    • Minimum description length (MDL) methods

  • Clustering (1 week)

    • k-means clustering

    • Hierarchical agglomerative clustering

    • Biclustering approaches

    • Advanced clustering methods: PCA/ICA, Kohonen's SOM

    • Applications to bioinformatics: clustering microarray data

  • Relational data mining (1.5 weeks)

    • fundamentals of relational databases

    • Structured Query Language: SELECT, JOIN, PROJECT

    • OLAP

    • database organization: star and constellation

    • probabilistic relational models (PRMs)

    • text mining fundamentals

    • data modeling in bioinformatics

  • Visualization (3 lectures)

    • Data and information visualization: scatterplots, evidence visualization

    • Output: Naive Bayes, decision trees and graphs; clusters and clustering trees

    • Survey of 3-D modelling

If you are interested in taking this course yourself or know someone qualified and interested, please do spread the word and let them know especially that the emphasis is on bioinformatics (a departure from previous offerings).



( 6 comments — Leave a comment )
May. 22nd, 2005 06:42 am (UTC)
:P I'd take it if I went to KSU, but I dont :(, instead I get to enjoy my old TA from Java teach me a class in a summer session.
May. 22nd, 2005 02:21 pm (UTC)
I'm not sure about this particular course since it has a lab, but they do offer many courses on the internet...
May. 22nd, 2005 06:56 pm (UTC)
I should do that, shouldn't I?
I do publish my courses using Tegrity, as I will with this one. Perhaps I should also record a set of lectures and use MIT OpenCourseWare or O'Reilly's SafariU to distribute them. What do you think?

(Tegrity requires some codecs, but all the Java components can be downloaded from our server.)

May. 22nd, 2005 07:44 pm (UTC)
Re: I should do that, shouldn't I?
Ooooo, spiffy! I'd love to hear one or two.

As for auctually taking the class over the 'net...um... don't think I need to do that ;) my school's got descent lectures....maybe if I could get them for supplementary learning, hehehe.
May. 22nd, 2005 08:47 pm (UTC)
Re: I should do that, shouldn't I?
I think about taking online courses in addition to any courses where I'll go to a local community college campus. UT doesn't really offer a whole lot of classes where you can earn credit by completing the coursework online, but I bet they'd accept credits from other institutions like K-State. CIS730 costs $1275!

I'd say if students sign up for it, why not?
May. 22nd, 2005 01:51 pm (UTC)
I'd be so in for that if it weren't for the fact that i'm doing bioinformatics at Case Western
( 6 comments — Leave a comment )

Latest Month

December 2008

KSU Genetic and Evolutionary Computation (GEC) Lab



Science, Technology, Engineering, Math (STEM) Communities

Fresh Pages


Powered by LiveJournal.com
Designed by Naoto Kishi