Notes from eScience 2008

Submitted by Bill Howe on December 8, 2008 - 11:36am

Last week, I had the opportunity to give two talks in Indianapolis: one at the IEEE eScience 2008 conference, and another at the co-located Microsoft eScience Workshop.

All the presentations were recorded and will soon be available online.

The event brought together a very diverse community, but managed to remain remarkably focused on the core research: new platforms for data-intensive science.

Key themes

Cloud architectures: What do we need to make commercial cloud offering suitable for science? For example, interconnect speeds are not usually included in SLAs, which poses a problem for tightly-coupled parallel science apps such as our own ocean circulation models. Are clouds fractal? That is, will there always be smaller, local clouds at individual institutions, or will Watson turn out to be right? Is it cloud or cloud + client? (I think a local presence is necessary.)
Cloud programming models: MPI, Workflow, Relational Algebra, MapReduce, and Microsoft's Dryad are all points along a spectrum of parallel programming abstractions for manipulating massive datasets. What are the limitations in performance and expressiveness with respect to specific domain applications?
Visualization: As data volumes explode, visualization is no longer a luxury but a necessity. There is simply no other way to convey details of large datasets except by harnessing the high bandwidth of the human ocular system. However, visualization is not enough. There is no effective method for visualizing high-dimensional data (50+ dimensions), so visualization techniques must be combined with dimension-reduction techniques such as multi-dimensional scaling or PCA. This area is referred to a visual analytics.
Applications: As computational technology becomes increasingly sophisticated, the skills required to operate it stretch furhter otu of reach of non-specialists. Computer Scientists can no longer simply throw generic tools over the wall for domain scientists to use in applications. We must build and deploy end-to-end applications as experiments, then extract the general techniques as they become apparent. The number and quality of application talks at this conference demonstrates that the eScience community has fully internalized this idea.

Bill Howe's blog

Recent Blog Posts

Views expressed are those of the author and do not necessarily reflect those of CMOP or OHSU. All content posted here must be consistent with OHSU's Acceptable Use of Computing and Telecommunications Resources policy.

You are here

Notes from eScience 2008

Main menu

Knowledge Transfer

Follow Us

Recent Blog Posts

You are here

Notes from eScience 2008

Main menu

Search form

Knowledge Transfer

Follow Us

Member Login

Recent Blog Posts