|
Database
Systems: Advanced Topics and Implementation
|
. |
Mon |
Tue |
Wed |
Thu |
Fri |
Lecture |
|
9:30–10:50 |
|
||
Instructor
Office Hours |
|
|
9:00
- 10:00 CSE 3238 |
. |
|
TA Office Hours |
11:00 - 12:00 CSE 3232 |
You may team up with (at most) one partner for the class projects. Use the Piazza forum to advertise if you are looking for a partner.
Our
class project is the construction of an XQuery
processor. We consider a subset/modification of XML’s
data model, XQuery, and XQuery’s type system as
described in this
note. The processor receives an XQuery, parses it
into an abstract tree representation, optimizes it and
finally executes the optimized plan.
To access XML files you can use the standard DOM
interface. There are a
number of XML DOM parser implementations.
The Java distribution includes one (see documentation here).
As an alternative, the Xerces-J project from Apache is
quite mature and stable.
The
W3C specification of DOM is here.
Test cases for project Phase I. For the data,
download Shakespeare's play, Julius Caesar, in XML
form (the associated DTD is here).
Queries can be found here.
Teams of two studens each will give a
20-minute talk in class, presenting a research paper
from the general field of XML-based data integration.
Here are a few suggestions (you are welcome to make
new ones after consulting with me).
Pick a slot here.
Time your presentations to last maximum
20 minutes in total (each presentation team member
covers about half of this time.)
Do not exceed this time, we want to allow
time for discussion, questions
and interruptions. Think of the
presentations as catalysts for (hopefully intense and
unruly) group discussions covering
both the material and the critique of the
presentation.
The
project constitutes 80% of the final grade, the
remaining 20% are earned for the presentation and class
participation.
Formal XPath Semantics note
A brief informal XQuery tutorial (much briefer and more readable than the W3C standard below)
Textbook material (from the warmly recommended textbook "Web Data Management"):
The complete XQuery and XML Schema documentation (the
WWW Consortium standards):