Database Reading Group

Papers previously discussed (most recent first):


Friday, 09 June 2017 @ 10:00 AM

Title: Smart Personalized Routing For Smart Cities

ICDE 2017

Author(s): Abdeltawab M. Hendawi Aqeel Rustum Amr A. Ahmadain David Hazel Ankur Teredesai Dev Oliver Mohamed Ali John A. Stankovic

Available at: http://www.cs.virginia.edu/hendawi/materials/ICDE_2017_CameraReady_421.pdf

Discussion Leader: Dave Maier


Friday, 02 June 2017 @ 10:00 AM

Title: Fast Queries Over Heterogeneous Data Through Engine Customization

VLDB 2016

Author(s): Manos Karpathiotakis, Ioannis Alagiannis, Anastasia Ailamaki

Available at: http://www.vldb.org/pvldb/vol9/p972-karpathiotakis.pdf

Discussion Leader: Chris


Friday, 19 May 2017 @ 10:00 AM

Title: Efficient Processing of Window Functions in Analytical SQL Queries

VLDB 2015

Author(s): Viktor Leis, Alfons Kemper, Kan Kundhikanjana, and Thomas Neumann

Available at: http://www.vldb.org/pvldb/vol8/p1058-leis.pdf

Discussion Leader: Hong Quach


Friday, 12 May 2017 @ 10:00 AM

Title: Reactive Vega: A Streaming Dataflow Architecture for Declarative Interactive Visualization

IEEE 2016

Author(s): Arvind Satyanarayan, Ryan Russell, Jane Hoffswell, and Jeffrey Heer

Available at: http://ieeexplore.ieee.org.proxy.lib.pdx.edu/stamp/stamp.jsp?arnumber=7192704

Discussion Leader: Basem


Friday, 05 May 2017 @ 10:00 AM

Title: Twitter Heron: Stream Processing at Scale

SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data

Author(s): Sanjeev Kulkarni, Nikunj Bhagat, Maosong Fu, Vikas Kedigehalli, Christopher Kellogg, Sailesh Mittal, Jignesh M. Patel*,Karthik Ramasamy, Siddarth Taneja

Available at: http://dl.acm.org/citation.cfm?id=2742788

Discussion Leader: Shreemoyee Sarkar


Friday, 28 April 2017 @ 10:00 AM

Title: Spatial Online Sampling and Aggregation

VLDB 2016

Author(s): Lu Wang, Robert Christensen, Feifei Li, Ke Yi

Available at: http://www.vldb.org/pvldb/vol9/p84-wang.pdf

Discussion Leader: Chris


Friday, 21 April 2017 @ 10:00 AM

Title: Studying the Wikipedia Hyperlink Graph for Relatedness and Disambiguation

arXiv 2015

Author(s): Eneko Agirre, Ander Barrena, Aitor Soroa

Available at: https://arxiv.org/pdf/1503.01655.pdf

Discussion Leader: Hisham


Friday, 14 April 2017 @ 10:00 AM

Title: SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning

CIDR 2017

Author(s): Tarek Elgamal, Shangyu Luo, Mattias Boehm, Alexandre V. Evfimievski, Shirish Tatikonda, Berthold Reinwald, Prithviraj Sen

Available at: http://cidrdb.org/cidr2017/papers/p3-elgamal-cidr17.pdf

Discussion Leader: Dave Maier


Friday, 17 March 2017 @ 10:00 AM

Title: SnappyData: A Unified Cluster for Streaming, Transactions and Interactice Analytics

CIDR 2017

Author(s): Barzan Mozafari, Jags Ramnarayan, Sudhir Menon, Yogesh Mahajan, Soubhik Chakraborty, Hemant Bhanawat, Kishor Bachhav

Available at: http://cidrdb.org/cidr2017/papers/p28-mozafari-cidr17.pdf

Discussion Leader: Dave Maier


Friday, 10 March 2017 @ 10:00 AM

Title: Reducing the storage overhead of main memory OLTP databases with hybrid indexes

SIGMOD’16, June 26-July 01, 2016, San Francisco, CA, USA

Author(s): Huanchen Zhang. David G. Andersen. Andrew Pavlo, Michael Kaminsky, Lin Ma, Rui Shen

Available at: https://dl.acm.org/citation.cfm?id=2915222

Discussion Leader: Shreemoyee Sarkar


Friday, 03 March 2017 @ 10:00 AM

Title: Facetedpedia: Dynamic Generation of Query-Dependent Faceted Interfaces for Wikipedia

World wide web 2010

Author(s): Chengkai Li, Ning Yan, Senjuti B. Roy, Lekhendro Lisham, Gautam Das

Available at: http://dl.acm.org/citation.cfm?id=1772757

Discussion Leader: Hisham Benotman


Friday, 24 February 2017 @ 10:00 AM

Title: The Myria Big Data Management and Analytics System and Cloud Services

CIDR 2017

Author(s): Jingjing Wang, Tobin Baker, Magda Balazinska, Daniel Halperin, Brandon Hayes, Bill Howe, Dylan Hutchinson, Shrainik Jain, Ryan Maas, Parmita Mehta, Dominik Moritz, Brandon Myers, Jennifer Ortiz, Dan Suciu, Andrew Whittaker, Shengliang Xu

Available at: http://cidrdb.org/cidr2017/papers/p37-wang-cidr17.pdf

Discussion Leader: Dave Maier


Friday, 17 February 2017 @ 10:00 AM

Title: LEOPARD: Lightweight Edge-Oriented Partitioning and Replication for Dynamic Graphs

VLDB 2016

Author(s): Jiewen Huang, Daniel Abadi

Available at: http://www.vldb.org/pvldb/vol9/p540-huang.pdf

Discussion Leader: Basem Elazzabi


Friday, 10 February 2017 @ 10:00 AM

Title: Self-driving database management systems

CIDR 2017

Author(s): Pavlo et al.

Available at: http://db.cs.cmu.edu/papers/2017/p42-pavlo-cidr17.pdf

Discussion Leader: Jeremiah Peschka


Friday, 03 February 2017 @ 10:00 AM

Title: SnappyData: A Unified Cluster for Streaming, Transactions and Interactice Analytics

CIDR 2017

Author(s): Barzan Mozafari, Jags Ramnarayan, Sudhir Menon, Yogesh Mahajan, Soubhik Chakraborty, Hemant Bhanawat, Kishor Bachhav

Available at: http://cidrdb.org/cidr2017/papers/p28-mozafari-cidr17.pdf

Discussion Leader: Dave Maier


Friday, 27 January 2017 @ 10:00 AM

Title: ULDBs: Databases with Uncertainty and Lineage

VLDB '06

Author(s): Omar Benjelloun, Anish Das Sarma, Alon Halevy, Jennifer Widom

Available at: http://dl.acm.org.proxy.lib.pdx.edu/citation.cfm?id=1164209&CFID=890942670&CFTOKEN=88409415

Discussion Leader: Chris Giossi


Friday, 02 December 2016 @ 10:00 AM

Title: Coordination Avoidance in Database Systems

VLDB 2015 - Proceedings of the VLDB Endowment, Vol. 8, No. 3

Author(s): Peter Bailis, Alan Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica

Available at: http://www.bailis.org/papers/ca-vldb2015.pdf

Discussion Leader: Jeremiah Peschka


Friday, 18 November 2016 @ 10:00 AM

Title: The Snowflake Elastic Data Warehouse

SIGMOD '16 Proceedings of the 2016 International Conference on Management of Data Pages 215-226 ACM

Author(s): Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Vadim Antonov, Artin Avanes, Jon Bock, Jonathan Claybaugh, Daniel Engovatov, Martin Hentschel, Jiansheng Huang, Allison W. Lee, Ashish Motivala, Abdul Q. Munir, Steven Pelley, Peter Povinec, Greg Rahn, S

Available at: http://dl.acm.org/citation.cfm?id=2903741

Discussion Leader: Shree


Friday, 04 November 2016 @ 10:00 AM

Title: Plenario: An Open Data Discovery and Exploration Platform for Urban Science

IEEE Data Engineering Bulletin, Dec 2014

Author(s): C. Cattlett et al.

Available at: http://sites.computer.org/debull/A14dec/p27.pdf

Discussion Leader: Dave


Friday, 28 October 2016 @ 10:00 AM

Title: Rough sets and intelligent data analysis

Information Sciences 147 (2002) 1–12

Author(s): Zdzisław Pawlak

Available at: http://bcpw.bg.pw.edu.pl/Content/1932/infSci2002.pdf

Discussion Leader: Basem


Friday, 21 October 2016 @ 10:00 AM

Title: Seven Databases in Seven Weeks

CMU Seminar series

Author(s): CMU Seminar series

Available at: http://db.cs.cmu.edu/seminar2014/

Discussion Leader: Dave


Friday, 14 October 2016 @ 10:00 AM

Title: Customized Random Walk for Generating Wikipedia Article Recommendations

Not published

Author(s): Jocelyn Hickcox and Chris Min

Available at: https://pdfs.semanticscholar.org/cf43/e1d4c94f85f5de4e36b9ab777595d0253b56.pdf

Discussion Leader: Hisham


Friday, 07 October 2016 @ 10:00 AM

Title: Photon: Fault-tolerant and Scalable Joining of Continuous Data Streams

SIGMOD 2013

Author(s): Rajagopal Ananthanarayanan, Venkatesh Basker, Sumit Das, Ashish Gupta, Haifeng Jiang, Tianhao Qiu, Alexey Reznichenko, Deomid Ryabkov, Manpreet Singh, Shivakumar Venkataraman

Available at: http://dl.acm.org/citation.cfm?id=2465272

Discussion Leader: Chris


Friday, 27 May 2016 @ 10:00 AM

Title: An Optimization Framework for Map-Reduce Queries

EDBT 2012

Author(s): Leonidas Fegaras, Chengkai Li, Upa Gupta

Available at: http://lambda.uta.edu/mrql.pdf

Discussion Leader: David Maier


Friday, 20 May 2016 @ 10:00 AM

Title: Assessing Learning Outcomes in Web Search: A Comparison of Tasks and Query Strategies

CHIIR 2016

Author(s): Kevyn Collins-Thompson, , Soo Young Rieh, , Carl C. Haynes, , Rohail Syed

Available at: http://dl.acm.org/authorize?N09409

Discussion Leader: Hisham Benotman


Friday, 06 May 2016 @ 10:00 AM

Title: Data Properties Using Abstraction to Enhance the Use of Data in Decision Making

RPE Talk

Author(s): Basem Elazzabi

Available at: Attached

Discussion Leader: Basem Elazzabi


Friday, 29 April 2016 @ 10:00 AM

Title: Equality Saturation: a New Approach to Optimization

POPL 2009

Author(s): Ross Tate Michael Stepp Zachary Tatlock Sorin Lerner

Available at: https://www.cs.cornell.edu/~ross/publications/eqsat/eqsat_tate_popl09.pdf

Discussion Leader: David Maier


Friday, 22 April 2016 @ 10:00 AM

Title: Can We Analyze Big Data Inside a DBMS?

DOLAP 2013

Author(s): Carlos Ordonez

Available at: "http://delivery.acm.org/10.1145/2520000/2513198/p85-ordonez.pdf?ip=131.252.200.53&id=2513198&acc=ACTIVE%20SERVICE&key=B63ACEF81C6334F5.CA8B0988038A4DF4.4D4702B0C3E38B35.4D4702B0C3E38B35&CFID=766571505&CFTOKEN=44707543&__acm__=1459531410_93a7a59985f51c8fd

Discussion Leader: Chris


Friday, 15 April 2016 @ 10:00 AM

Title: SociaLite: An Efficient Graph Query Language Based on Datalog

TKDE July-Aug 2015

Author(s): J. Seo ; Dept. of Comput. Sci., Stanford Univ., Stanford, CA, USA ; S. Guo ; M. S. Lam

Available at: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7045548

Discussion Leader: David Maier


Friday, 08 April 2016 @ 10:00 AM

Title: "A Rule-Based Citation System for Structured and Evolving Datasets"

IEEE Data Eng. 2010

Author(s): "Peter Buneman & Gianmaria Silvello "

Available at: https://www.researchgate.net/profile/Gianmaria_Silvello/publication/220283202_A_Rule-Based_Citation_System_for_Structured_and_Evolving_Datasets/links/09e414fc28de624332000000.pdf

Discussion Leader: Abdussalam Alawini


Friday, 04 March 2016 @ 10:00 AM

Title: Querying and Managing Provenance through User Views in Scientific Workflows

Data Engineering, 2008.

Author(s): Biton, O. Cohen-Boulakia, S. ; Davidson, S.B. ; Hara, C.S.

Available at: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=4497516&url=http%3A%2F%2Fieeexplore.ieee.org%2Fxpls%2Fabs_all.jsp%3Farnumber%3D4497516

Discussion Leader: Abdussalam Alawini


Friday, 26 February 2016 @ 10:00 AM

Title: The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing

VLDB 2015

Author(s): Tyler Akidau, Robert Bradshaw, Craig Chambers, Slava Chernyak, Rafael J. Fernandez-Moctezuma, Reuven Lax, Sam McVeety, Daniel Mills, Frances Perry, Eric Schmidt, Sam Whittle

Available at: http://www.vldb.org/pvldb/vol8/p1792-Akidau.pdf

Discussion Leader: David Maier


Friday, 19 February 2016 @ 10:00 AM

Title: Entity ranking in Wikipedia

SAC '08 Proceedings of the 2008 ACM symposium on Applied computing

Author(s): Anne-Marie Vercoustre, James A. Thom, Jovan Pehcevski

Available at: http://dl.acm.org/citation.cfm?id=1363943

Discussion Leader: Hisham Benotman


Friday, 12 February 2016 @ 10:00 AM

Title: Orleans: Distributed Virtual Actors for Programmability and Scalability

MSR-TR-2014-41

Author(s): Philip A. Bernstein, Sergey Bykov, Alan Geller, Gabriel Kliot, and Jorgen Thelin

Available at: http://research.microsoft.com/apps/pubs?id=210931

Discussion Leader: David Maier


Friday, 05 February 2016 @ 10:00 AM

Title: An Architecture for Compiling UDF-centric Workflows

VLDB 2015

Author(s): Andrew Crotty, Alex Galaktos, Kayhan Dursun, Tim Kraska, Carsten Binnig, Ugur Cetintemel, Stan Zdonik

Available at: http://www.vldb.org/pvldb/vol8/p1466-crotty.pdf

Discussion Leader: Chris


Friday, 22 January 2016 @ 10:00 AM

Title: Combining Dependent Annotations for Relational Algebra

ICDT 2012

Author(s): Egor V. Kostylev, Peter Buneman

Available at: http://dl.acm.org.proxy.lib.pdx.edu/citation.cfm?id=2274597&CFID=575437389&CFTOKEN=59604673

Discussion Leader: Basem Elazzabi


Friday, 04 December 2015 @ 10:00 AM

Title: Reducing Implicit Racial Preferences: I. A Comparative Investigation of 17 Interventions

Journal of Experimental Psychology

Author(s):

Available at: http://www.fas.harvard.edu/~mrbworks/articles/2014_Lai_JESPG.pdf

Discussion Leader: Lois Delcambre


Friday, 20 November 2015 @ 10:00 AM

Title: Supervised Meta-blocking

VLDB14

Author(s): George Papadakis, George Papastefanatos, Georgia Koutrika

Available at: http://www.vldb.org/pvldb/vol7/p1929-papadakis.pdf

Discussion Leader: Abdussalam Alawini


Friday, 13 November 2015 @ 10:00 AM

Title: Serving DBpedia with DOLCE – More than Just Adding a Cherry on Top

ISWC15

Author(s): Heiko Paulheim and Aldo Gangemi

Available at: http://www.heikopaulheim.com/docs/iswc2015.pdf

Discussion Leader: Scott Britell


Friday, 06 November 2015 @ 10:00 AM

Title: RPE Talk

Author(s): Hisham Benothman

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Hisham Benotman


Friday, 30 October 2015 @ 10:00 AM

Title: Musketeer: all for one, one for all in data processing systems

EuroSys 15

Author(s): Gog, Schwarzkopf, Crook, Grosvenor, Clement, Hand

Available at: http://dl.acm.org/citation.cfm?id=2741968

Discussion Leader: David Maier


Friday, 16 October 2015 @ 10:00 AM

Title: MillWheel: Fault-Tolerant Stream Processing at Internet Scale

VLDB 2015

Author(s): Tyler Akidau, Alex Balikov, Kaya Bekiroglu,Slava Chernyak, Josh Haberman, Reuven Lax,Sam McVeety, Daniel Mills, Paul Nordstrom,Sam Whittle

Available at: http://www.cs.cmu.edu/~pavlo/courses/fall2013/static/papers/p734-akidau.pdf

Discussion Leader: Christopher Giossi


Friday, 09 October 2015 @ 10:00 AM

Title: Data-centric iteration in dynamic workflows

Elsevier - Future Generation Computer Systems The International Journal of eScience

Author(s): Jonas Diasa, , Gabriel Guerraa, , Fernando Rochinhaa, , Alvaro L.G.A. Coutinhoa, , Patrick Valduriezb, , Marta Mattoso

Available at: http://www.sciencedirect.com.proxy.lib.pdx.edu/science/article/pii/S0167739X14002155

Discussion Leader: Basem Elazzabi


Thursday, 10 September 2015 @ 10:00 AM

Title: Data-centric iteration in dynamic workflows

Elsevier - Future Generation Computer Systems The International Journal of eScience

Author(s): Jonas Diasa, Gabriel Guerraa, Fernando Rochinhaa , Alvaro L.G.A. Coutinhoa, Patrick Valduriezb, Marta Mattoso

Available at: http://www.sciencedirect.com.proxy.lib.pdx.edu/science/article/pii/S0167739X14002155

Discussion Leader: Basem Elazzabi


Friday, 05 June 2015 @ 10:00 AM

Title: Sample-Driven Schema Mapping

SIGMOD ’12, May 20–24, 2012, Scottsda le, Arizona, USA

Author(s): Li Qian, Michael J. Cafarella, H. V. Jagadish

Available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.354.173&rep=rep1&type=pdf

Discussion Leader: Basem Elazzabi


Friday, 29 May 2015 @ 10:00 AM

Title: Content Knowledge for Teaching What Makes It Special?

Journal of Teacher Education Volume 59 Number 5 November/December 2008

Author(s): Deborah Loewenberg Ball, Mark Hoover Thames, and Geoffrey Phelps

Available at: http://jte.sagepub.com/content/59/5/389.full.pdf+html

Discussion Leader: Lois Delcambre


Friday, 22 May 2015 @ 10:00 AM

Title: Hongkong International Terminals Gains Elastic Capacity Using a Data-Intensive Decision-Support System

Interfaces 35(1), pp. 61–75, © 2005 INFORMS

Author(s): K. G. Murty, et al.

Available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.75.7805&rep=rep1&type=pdf

Discussion Leader: David Maier


Friday, 15 May 2015 @ 10:00 AM

Title: Schema-free SQL

SIGMOD’14, June 22–27, 2014, Snowbird, UT, USA

Author(s): Fei Li, Tianyin Pan, H. V. Jagadish

Available at: http://wwweb.eecs.umich.edu/db/files/SIGMOD14LFa.pdf

Discussion Leader: Basem Elazzabi


Friday, 08 May 2015 @ 10:00 AM

Title: Multiple Diagram Navigation MDN (RPE Talk)

Author(s): Hisham Benothman

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Hisham Benotman


Friday, 01 May 2015 @ 10:00 AM

Title: A semantic approach to data translation: A case study of environmental observations data

Knowledge-Based Systems 75 (2015) 104–123

Author(s): Yanfeng Shu , David Ratcliffe , Michael Compton , Geoffrey Squire , Kerry Taylor

Available at: Attached

Discussion Leader: Veronika Megler


Friday, 24 April 2015 @ 10:00 AM

Title: Making State Explicit for Imperative Big Data Processing

Visualization and Computer Graphics, IEEE Transactions on (Volume:20 , Issue: 12 )

Author(s): Raul Castro Fernandez, Matteo Migliavacca†, Evangelia Kalyvianaki , Peter Pietzuch

Available at: https://www.usenix.org/system/files/conference/atc14/atc14-paper-castro_fernandez.pdf

Discussion Leader: David Maier


Friday, 17 April 2015 @ 10:00 AM

Title: TBD

TBD

Author(s): TBD

Available at: TBD

Discussion Leader: TBD


Friday, 10 April 2015 @ 10:00 AM

Title: Predictive Interaction for Data Transformation

CIDR 2015

Author(s): Jeffery Heer, Joseph Hellerstein, and Sean Kandel

Available at: https://idl.cs.washington.edu/files/2015-PredictiveInteraction-CIDR.pdf

Discussion Leader: Abdussalam Alawini


Friday, 13 March 2015 @ 10:00 AM

Title: DATA STREAM WAREHOUSING IN TIDALRACE

CIDR 2015

Author(s): Theodore Johnson (AT&T Labs – Research); Vladislav Shkapenyuk (AT&T Labs – Research)

Available at: http://www.cidrdb.org/cidr2015/Papers/CIDR15_Paper4.pdf

Discussion Leader: David Maier


Friday, 06 March 2015 @ 10:00 AM

Title: Assigning Search Tasks Designed to Elicit Exploratory Search Behaviors

Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval. ACM, 2012

Author(s): Barbara M. Wildemuth & Luanne Freund

Available at: http://ils.unc.edu/searchtasks/publication/publication_2.pdf

Discussion Leader: Hisham Benotman


Friday, 27 February 2015 @ 12:00 AM

Title: CROWDMAP: Crowdsourcing Ontology Alignment with Microtasks

ISWC 2012

Author(s): Cristina Sarasua,, Elena Simperl, and Natalya F. Noy

Available at: http://web.stanford.edu/~natalya/papers/iswc2012_crowdmap.pdf

Discussion Leader: Lois Delcambre


Friday, 20 February 2015 @ 10:00 AM

Title: DATAHUB: COLLABORATIVE DATA SCIENCE & DATASET VERSION MANAGEMENT AT SCALE

CIDR 2015

Author(s): Anant Bhardwaj (MIT);  Souvik Bhattacherjee (U. Maryland); Amit Chavan (U. Maryland); Amol Deshpande(U. Maryland); Aaron J. Elmore (MIT & U. Chicago); Samuel Madden (MIT); Aditya Parameswaran (MIT & U. Illinois)

Available at: http://www.cidrdb.org/cidr2015/Papers/CIDR15_Paper18.pdf

Discussion Leader: Basem Elazzabi


Friday, 13 February 2015 @ 10:00 AM

Title: Analyzing Schema.org

ISWC 2014

Author(s): Peter F. Patel-Schneider

Available at: https://github.com/lidingpku/iswc2014/blob/master/paper/87960257-analyzing-schemaorg.pdf?raw=true

Discussion Leader: Lois and Scott


Friday, 13 February 2015 @ 10:00 AM

Title: Deployment of RDFa, Microdata, and Microformats on the Web – A Quantitative Analysis

ISWC 2013

Author(s): Christian Bizer, Kai Eckert, Robert Meusel, Hannes Mühleisen, Michael Schuhmacher, andJohanna Völker

Available at: http://hannes.muehleisen.org/Bizer-etal-DeploymentRDFaMicrodataMicroformats-ISWC-InUse-2013.pdf

Discussion Leader: Lois and Scott


Friday, 06 February 2015 @ 10:00 AM

Title: TUPLEWARE: ''BIG'' DATA, BIG ANALYTICS, SMALL CLUSTERS

CIDR 2015

Author(s): Andrew Crotty (Brown University); Alex Galakatos (Brown University); Kayhan Dursun (Brown University); Tim Kraska (Brown University); Ugur Cetintemel (Brown  University); Stan Zdonik  (Brown University)

Available at: http://www.cidrdb.org/cidr2015/Papers/CIDR15_Paper23u.pdf

Discussion Leader: David Maier


Friday, 30 January 2015 @ 10:00 AM

Title: Biperpedia: An Ontology for Search Applications

2014 VLDB

Author(s): Rahul Guptay Alon Halevyy Xuezhi Wangx Steven Euijong Whangy Fei Wuy

Available at: http://static.googleusercontent.com/media/research.google.com/en/us/pubs/archive/41894.pdf

Discussion Leader: Veronika Megler


Friday, 23 January 2015 @ 10:00 AM

Title: Explass: Exploring Associations between Entities via Top-K Ontological Patterns and Facets

ISWC 2014

Author(s): Gong Cheng, Yanan Zhang and Yuzhong Qu

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Scott Britell


Friday, 16 January 2015 @ 10:00 AM

Title: Privacy-preserving record linkage using Bloom filters

BMC Medical Informatics and Decision Making 2009, 9:41

Author(s): Rainer Schnell, Tobias Bachteler and Jörg Reiher

Available at: http://www.biomedcentral.com/content/pdf/1472-6947-9-41.pdf

Discussion Leader: Abdussalam Alawini


Friday, 05 December 2014 @ 10:00 AM

Title: Detection, Simulation and Elimination of Semantic Anti-patterns in Ontology-Driven Conceptual Models

ER 2014

Author(s): Giancarlo Guizzardi, Tiago Prince Sales

Available at: http://www.inf.ufes.br/~gguizzardi/ER2014-CR++.pdf

Discussion Leader: Scott Britell


Friday, 21 November 2014 @ 10:00 AM

Title: Towards Integrating the Detection of Genetic Variants into an In-Memory Database

2014 IEEE International Conference on Big Data

Author(s): Cindy Fähnrich, Matthieu-P. Schapranow, Hasso Plattner

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Patrick Leyshock


Friday, 14 November 2014 @ 10:00 AM

Title: Uniform access to NoSQL systems

Inf. Syst. (IS) 43:117-133 (2014)

Author(s): Paolo Atzeni, Francesca Bugiotti, Luca Rossi

Available at: http://www.bugiotti.it/downloads/publications/sosIS13.pdf

Discussion Leader: Paolo Atzeni


Friday, 07 November 2014 @ 10:00 AM

Title: A runtime approach to model-generic translation of schema and data

Information Systems Volume 37 Issue 3, May, 2012

Author(s): Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, Fabrizio Celli, and Giorgio Gianforme.

Available at: http://www.dia.uniroma3.it/~atzeni/psfiles/IS2012.pdf

Discussion Leader: Lois Delcambre


Friday, 31 October 2014 @ 10:00 AM

Title: Kinds of contexts and their impact on semantic similarity measurement

Pervasive Computing and Communications 2008

Author(s): Krzysztof Janowicz

Available at: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4517435

Discussion Leader: Veronika Megler


Friday, 24 October 2014 @ 10:00 AM

Title: Exploring the Design Space of Composite Visualization

Pacific Visualization Symposium (PacificVis), 2012 IEEE

Author(s): Waqas Javed & Niklas Elmqvist

Available at: https://engineering.purdue.edu/~elm/projects/compvis/compvis.pdf

Discussion Leader: Hisham Benotman


Friday, 10 October 2014 @ 10:00 AM

Title: The Trill Incremental Analytics Engine

MSR-TR-2014-54

Author(s): Badrish Chandramouli, Jonathan Goldstein, Mike Barnett, Robert DeLine, Danyel Fisher, John C. Platt, James F. Terwilliger, and John Wernsing

Available at: http://research.microsoft.com/pubs/214609/trill-TR.pdf

Discussion Leader: David Maier


Friday, 06 June 2014 @ 10:00 AM

Title: Table extraction using conditional random fields

SIGIR '03 Proceedings

Author(s): David Pinto, Andrew McCallum, Xing Wei and W. Bruce Croft

Available at: http://dl.acm.org/ft_gateway.cfm?id=860479&ftid=212136&dwn=1&CFID=431341442&CFTOKEN=40719278

Discussion Leader: Done Hertel


Friday, 30 May 2014 @ 10:00 AM

Title: Shark: SQL and Rich Analytics at Scale

Proceedings of the 2013 international conference on Management of data.

Author(s): R. Xin, J. Rosen, M. Zaharia, M. Franklin, S. Shenker, I. Stoica

Available at: http://arxiv.org/pdf/1211.6176

Discussion Leader: Patrick Leyshock


Friday, 23 May 2014 @ 10:00 AM

Title: The Conceptual Model ≡ An Adequate and Dependable Artifact Enhanced by Concepts

Info Modeling & Knowledge Bases, IOS Press, 2014

Author(s): Bernhard Thalheim

Available at: TBA

Discussion Leader: Lois Delcambre


Friday, 09 May 2014 @ 10:00 AM

Title: Automatic Web Spreadsheet Data Extraction

VLDB Workshop on Semantic Search over the Web, Trento, Italy. 2013

Author(s): Zhe Chen, Michael Cafarella

Available at: http://web.eecs.umich.edu/~chenzhe/paper/shirley_vldbssw.pdf

Discussion Leader: Abdussalam Alawini


Friday, 02 May 2014 @ 10:00 AM

Title: The Bohemian Bookshelf: Supporting Serendipitous Book Discoveries through Information Visualization

CHI '12 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems 2012

Author(s): Alice Thudt, Uta Hinrichs and Sheelagh Carpendale

Available at: http://dl.acm.org/citation.cfm?id=2208607

Discussion Leader: Hisham Benotman


Friday, 25 April 2014 @ 10:00 AM

Title: Schema exchange: Generic mappings for transforming data and metadata

Data & Knowledge Engineering Volume 68 Issue 7, July, 2009

Author(s): Paolo Papotti and Riccardo Torlone

Available at: http://www.dia.uniroma3.it/~torlone/pubs/dke09.pdf

Discussion Leader: Scott Britell


Friday, 18 April 2014 @ 10:00 AM

Title: Towards the web of concepts: extracting concepts from large datasets

VLDB 2010

Author(s): Aditya Parameswaran, Hector Garcia-Molina, Anand Rajaraman

Available at: http://dl.acm.org/citation.cfm?id=1920914

Discussion Leader: Veronik Megler


Friday, 11 April 2014 @ 10:00 AM

Title: Data Curation at Scale: The Data Tamer System

CIDR 2013

Author(s): Michael Stonebraker (MIT); Daniel Bruckner (UC Berkeley); Ihab Ilyas (QCRI); George Beskales (QCRI); Mitch Cherniack (Brandeis University); Stan Zdonik (Brown University); Alexander Pagan (MIT); Shan Xu (Verisk Analytics)

Available at: http://www.cidrdb.org/cidr2013/Papers/CIDR13_Paper28.pdf

Discussion Leader: David Maier


Friday, 14 March 2014 @ 10:00 AM

Title: Scalable Anomaly Detection for Smart City Infrastructure Networks

Internet Computing, IEEE (Volume:17 , Issue: 6 )

Author(s): Difallah, Djellel Eddine Cudre-Mauroux, Philippe ; McKenna, Sean A.

Available at: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?reload=true&arnumber=6576747

Discussion Leader: Veronika Megler


Friday, 07 March 2014 @ 10:00 AM

Title: Do Graphical Search Interfaces Support Effective Search for and Evaluation of Digital Library Resources?

JCDL’11

Author(s): Kirsten R. Butcher, Sarah Davies, Ashley Crockett, Aaron Dewald, Robert Zheng

Available at: http://dl.acm.org/citation.cfm?id=1998136

Discussion Leader: Hisham Benotman


Friday, 28 February 2014 @ 10:00 AM

Title: Profiling, What-if Analysis, and Cost-Based Optimization of MapReduce Programs

VLDB 2011

Author(s): H. Herodotou & S. Babu

Available at: http://152.3.140.1/~hero/files/vldb11-job-optimization.pdf

Discussion Leader: Patrick Leyshock


Friday, 21 February 2014 @ 10:00 AM

Title: Scientific Data Management in the Coming Decade

Microsoft Research Tech. Report 2005

Author(s): Jim Gray, David T. Liu, Maria Nieto-Santisteban, Alexander S. Szalay, David DeWitt and Gerd Heber

Available at: http://arxiv.org/pdf/cs.DB/0502008.pdf

Discussion Leader: Abdussalam Alawini


Friday, 14 February 2014 @ 10:00 AM

Title: Visual Cluster Exploration of Web Clickstream Data

IEEE Symposium on Visual Analytics Science and Technology 2012

Author(s): Jishang Wei,et. al.

Available at: http://users.soe.ucsc.edu/~pang/visweek/2012/vast/papers/wei.pdf

Discussion Leader: Dona Hertel


Friday, 07 February 2014 @ 10:00 AM

Title: Rank and Relevance in Novelty and Diversity Metrics

RecSys’11

Author(s): Saúl Vargas and Pablo Castells

Available at: http://ir.ii.uam.es/pubs/recsys11-vargas.pdf

Discussion Leader: Jeremy Steinhauer


Friday, 31 January 2014 @ 10:00 AM

Title: NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data

Mobile Web Information Systems Lecture Notes in Computer Science Volume 8093, 2013, pp 172-186

Author(s): Mariusz Wisniewski, Gianluca Demartini, Apostolos Malatras, Philippe Cudré-Mauroux

Available at: http://link.springer.com/chapter/10.1007%2F978-3-642-40276-0_14

Discussion Leader: David Maier


Friday, 24 January 2014 @ 10:00 AM

Title: Extending relational query optimization to dynamic schemas for information integration in multidatabases

2007 ACM SIGMOD

Author(s): Catharine M. Wyss and Felix I. Wyss.

Available at: http://dl.acm.org/citation.cfm?id=1247480.1247533

Discussion Leader: Scott Britell


Friday, 17 January 2014 @ 10:30 AM

Title: Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results

Proceedings of the Nineteenth Annual International ACM SIGIR Conference, Zurich, June 1996.

Author(s): Marti A. Hearst and Jan O. Pedersen

Available at: http://people.ischool.berkeley.edu/~hearst/papers/sg-sigir96/sigir96.html also in ACM digital library

Discussion Leader: Lois Delcambre


Friday, 17 January 2014 @ 10:00 AM

Title: The cluster hypothesis revisited

SIGIR 1985

Author(s): Voorhees E.

Available at: http://dl.acm.org/citation.cfm?id=253524

Discussion Leader: Lois Delcambre


Friday, 10 January 2014 @ 10:00 AM

Title: Paper Choosing Session

Author(s):

Available at: #

Discussion Leader:


Friday, 10 January 2014 @ 12:00 AM

Title: Paper Choosing Session

Author(s):

Available at: #

Discussion Leader:


Friday, 06 December 2013 @ 10:00 AM

Title: Rank and relevance in novelty and diversity metrics for recommender systems

Author(s): Saúl Vargas and Pablo Castells

Available at: http://ir.ii.uam.es/pubs/recsys11-vargas.pdf

Discussion Leader: Dona Hertel


Friday, 22 November 2013 @ 10:00 AM

Title: GenBase: A Benchmark for the Genomics Era

Author(s): Pradeep Dubey, Nadathur Satish, Narayanan Sundaram, Sam Madden, Mike Stonebraker, Rebecca Taft and Manasi Vartak

Available at: N/A

Discussion Leader: Patrick Leyshock


Friday, 15 November 2013 @ 10:00 AM

Title: DBpedia - A Crystallization Point for the Web of Data

Author(s): Christian Bizer , Jens Lehmann , Georgi Kobilarov, Soren Auer, Christian Becker, Richard Cyganiak, Sebastian Hellmann

Available at: http://w.websemanticsjournal.org/index.php/ps/article/viewFile/164/162

Discussion Leader: Hisham Benotman


Friday, 08 November 2013 @ 10:00 AM

Title: Tracking Trash

Author(s): Santi Phithakkitnukoon, Malima I. Wolf, Dietmar Offenhuber, David Lee, Assaf Biderman, Carlo Ratti

Available at: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=6504856

Discussion Leader: David Maier


Friday, 01 November 2013 @ 10:00 AM

Title: Tuning Large Scale Deduplication with Reduced Effort

Author(s): Guilherme Dal Bianco, Renata Galante,Carlos A. Heuser, and Marcos André Gonçalves

Available at: http://sacan.biomed.drexel.edu/cmteditor/program/download?confname=ssdbm2013&pid=80&file=paper

Discussion Leader: Abdussalam Alawini


Friday, 25 October 2013 @ 10:00 AM

Title: Item popularity and recommendation accuracy

Author(s): Harald Steck

Available at: http://dx.doi.org/10.1145/2043932.2043957

Discussion Leader: Jeremy Steinhauer


Friday, 18 October 2013 @ 10:00 AM

Title: From Personal Desktops to Personal Dataspaces: A Report on Building the iMeMex Personal Dataspace Management System

Author(s): Jens-Peter Dittrich Lukas Blunschi Markus Färber Olivier René Girard Shant Kirakos Karakashian Marcos Antonio Vaz Salles

Available at: http://infosys.cs.uni-saarland.de/publications/DBF+07.pdf

Discussion Leader: Veronika Megler


Friday, 11 October 2013 @ 10:00 AM

Title: Semi-Automatically Mapping Structured Sources into the Semantic Web

Author(s): Craig A. Knoblock, Pedro Szekely, José Luis Ambite, Aman Goel, Shubham Gupta, Kristina Lerman, Maria Muslea, Mohsen Taheriyan, Parag Mallick

Available at: http://www.isi.edu/integration/papers/knoblock12-eswc.pdf

Discussion Leader: Scott Britell


Friday, 28 June 2013 @ 10:00 AM

Title: Paper-choosing Session

Author(s):

Available at: #

Discussion Leader:


Friday, 07 June 2013 @ 10:00 AM

Title: Data cleaning: Problems and current approaches

IEEE Data Engineering Bulletin 2000

Author(s): Rahm, Erhard and Do, and Hong Hai

Available at: http://dc-pubs.dbs.uni-leipzig.de/files/Rahm2000DataCleaningProblemsand.pdf

Discussion Leader: Abdussalam Alawini


Friday, 31 May 2013 @ 10:00 AM

Title: Stability of Recommendation Algorithms

TOIS 12 vol 4

Author(s): Adomavicius and Zhang

Available at: http://dl.acm.org/citation.cfm?id=2382442

Discussion Leader: Jeremy Steinhauer


Friday, 10 May 2013 @ 10:00 AM

Title: Schema mediation in peer data management systems

ICDE 2003

Author(s): Halevy, A.Y.; Ives, Z.G.; Suciu, D.; Tatarinov, I.

Available at: http://repository.upenn.edu/cgi/viewcontent.cgi?article=1113&context=cis_papers

Discussion Leader: Veronika Megler


Friday, 03 May 2013 @ 10:00 AM

Title: A survey of query-by-humming similarity methods

PETRA 2012

Author(s): Kotsifakos, Alexios et. al.

Available at: http://vlm1.uta.edu/~akotsif/petra2012.pdf

Discussion Leader: Dona Hertel


Friday, 26 April 2013 @ 10:00 AM

Title: Identifying Relationships between Spreadsheets

RPE

Author(s): Abdussalam Alawini

Available at:

Discussion Leader: Abdussalam Alawini


Friday, 19 April 2013 @ 10:00 AM

Title: SociaLite: Datalog Extensions for Efficient Social Network Analysis

ICDE 2013

Author(s): Jiwon Seo (Stanford), Stephen Guo (Stanford), Monica Lam (Stanford)

Available at: http://mobisocial.stanford.edu/papers/icde13.pdf

Discussion Leader: Scott Britell


Friday, 12 April 2013 @ 10:00 AM

Title: Constructivism in Computer Science Education

SIGCSE 1998

Author(s): Mordechai Ben-Ari

Available at: http://dl.acm.org/citation.cfm?id=274790.274308

Discussion Leader: Lois Delcambre


Friday, 12 April 2013 @ 10:00 AM

Title: Why Minimal Guidance During Instruction Does Not Work; An Analysis of the Failure of Constructivist, Discvoery, Problem-Based, Experiential, and Inquiry-based Teaching

Educational Psychologist, 41:2, 75-86, 2006

Author(s): Paul A. Kirschner, John Sweller, and Richard E. Clark

Available at: http://www.tandfonline.com/doi/pdf/10.1207/s15326985ep4102_1

Discussion Leader: Lois Delcambre


Friday, 05 April 2013 @ 11:00 AM

Title: Paper-choosing Session

Author(s):

Available at: #

Discussion Leader:


Thursday, 14 March 2013 @ 2:00 PM

Title: The Clio project: managing heterogeneity

SIGMOD Record 2001

Author(s): Renée J. Miller, Mauricio A. Hernández, Laura M. Haas, Lingling Yan, C. T. Howard Ho, Ronald Fagin, and Lucian Popa

Available at: http://www.sigmod.org/publications/sigmod-record/0103/JP-Sys.pdf

Discussion Leader: Dona Hertel


Thursday, 28 February 2013 @ 2:00 PM

Title: Incorporating variability in user behavior into systems based evaluation

CIKM 2012

Author(s): Ben Carterette, Evangelos Kanoulas, and Emine Yilmaz

Available at: http://dl.acm.org/citation.cfm?id=2396782

Discussion Leader: Jeremy Steinhauer


Thursday, 14 February 2013 @ 2:00 PM

Title: Observation-Driven Geo-Ontology Engineering

Transactions in GIS 2012

Author(s): Krzysztof Janowicz

Available at: http://geog.ucsb.edu/~jano/ODOEfinaldraft.pdf

Discussion Leader: Scott Britell


Thursday, 07 February 2013 @ 2:00 PM

Title: Efficient classification across multiple database relations: a CrossMine approach

IEEE Transactions on Knowledge and Data Engineering 2006

Author(s): Yin, X.; Han, J.; Yang, J.; Yu, P.S.

Available at: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=1626232

Discussion Leader: Abdussalam Alawini


Thursday, 31 January 2013 @ 2:00 PM

Title: Validating Multi-column Schema Matchings by Type

ICDE 2008

Author(s): Bing Tian Dai; Koudas, N.; Srivastava, D.; Tung, A.K.H.; Venkatasubramanian, S.

Available at: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=4497420

Discussion Leader: Lois Delcambre


Thursday, 24 January 2013 @ 2:00 PM

Title: Matching unstructured product offers to structured product specifications

SIGKDD 2011

Author(s): Anitha Kannan, Inmar E. Givoni, Rakesh Agrawal, and Ariel Fuxman

Available at: http://dl.acm.org/citation.cfm?id=2020474

Discussion Leader: Veronika Megler


Thursday, 17 January 2013 @ 2:00 PM

Title: Automatic partitioning of database applications

VLDB 2012

Author(s): Alvin Cheung, Samuel Madden, Owen Arden, and Andrew C. Myers

Available at: http://vldb.org/pvldb/vol5/p1471_alvincheung_vldb2012.pdf

Discussion Leader: Patrick Leyshock


Friday, 11 January 2013 @ 10:00 AM

Title: Paper-choosing Session

Author(s):

Available at: #

Discussion Leader:


Friday, 30 November 2012 @ 10:00 AM

Title: Contextualized knowledge repositories for the Semantic Web

We propose Contextualized Knowledge Repository (CKR): an adaptation of the well studied theories of context for the Semantic Web. A CKR is composed of a set of OWL 2 knowledge bases, which are embedded in a context by a set of qualifying attributes (time, space, topic, etc.) specifying the boundaries within which the knowledge base is assumed to be true. Contexts of a CKR are organized by a hierarchical coverage relation, which enables an effective representation of knowledge and a flexible method for its reuse between the contexts. The paper defines the syntax and the semantics of CKR; shows that concept satisfiability and subsumption are decidable with the complexity upper bound of 2NExpTime, and it also provides a sound and complete natural deduction calculus that serves to characterize the propagation of knowledge between contexts.

Author(s): Luciano Serafini, Martin Homola

Available at: https://dkm.fbk.eu/images/1/1b/Jws-serafini-homola-ckr-2012.pdf

Discussion Leader: Lois Delcambre


Friday, 16 November 2012 @ 10:00 AM

Title: Spanner: Google's Globally-Distributed Database

Spanner is Google's scalable, multi-version, globally-distributed, and synchronously-replicated database. It is the first system to distribute data at global scale and support externally-consistent distributed transactions. This paper describes how Spanner is structured, its feature set, the rationale underlying various design decisions, and a novel time API that exposes clock uncertainty. This API and its implementation are critical to supporting external consistency and a variety of powerful features: non-blocking reads in the past, lock-free read-only transactions, and atomic schema changes, across all of Spanner.

Author(s): James C. Corbett, Jeffrey Dean, Michael Epstein, Andrew Fikes, Christopher Frost, JJ Furman, Sanjay Ghemawat, Andrey Gubarev, Christopher Heiser, Peter Hochschild, Wilson Hsieh, Sebastian Kanthak, Eugene Kogan, Hongyi Li, Alexander Lloyd, Sergey Melnik, D

Available at: http://research.google.com/archive/spanner.html

Discussion Leader: Bryon Nevis


Friday, 09 November 2012 @ 10:00 AM

Title: Multidimensional Integrated Ontologies: A Framework for Designing Semantic Data Warehouses

The Semantic Web enables companies and organizations to gather huge amounts of valuable semantically annotated data concerning their subjects of interest. Nowadays, many applications attach metadata and semantic annotations taken from domain and application ontologies to the information they generate. From our point of view, the concepts in these ontologies could describe the facts, dimensions, categories and values implied in the analysis subjects of a data warehouse. In this paper we propose the Semantic Data Warehouse to be a repository of ontologies and semantically annotated data resources. We also propose an ontology-driven framework to design multidimensional analysis models for Semantic Data Warehouses. This framework provides means for building an integrated ontology, called the Multidimensional Integrated Ontology (MIO), including the classes, relationships and instances that represent interesting analysis dimensions and measures. The reasoning capabilities of a MIO can be used to check the properties required by current multidimensional databases (e.g., dimension orthogonality, category satisfiability, etc.). In this paper we also sketch how the instance data of a MIO can be translated into OLAP cubes for analysis purposes. Finally, some implementation issues of the overall framework are discussed. Keywords: Data warehouses, Semantic Web, Multi-ontology integration 1.

Author(s): Victoria Nebot, Rafael Berlanga, Juan Manuel Pérez, María José Aramburu und Torben Bach Pedersen

Available at: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.167.7367

Discussion Leader: ChristopheSchuetz


Friday, 02 November 2012 @ 10:00 AM

Title: Time-based calibration of effectiveness measures

Many current effectiveness measures incorporate simplifying assumptions about user behavior. These assumptions prevent the measures from reflecting aspects of the search process that directly impact the quality of retrieval results as experienced by the user. In particular, these measures implicitly model users as working down a list of retrieval results, spending equal time assessing each document. In reality, even a careful user, intending to identify as much relevant material as possible, must spend longer on some documents than on others. Aspects such as document length, duplicates and summaries all influence the time required. In this paper, we introduce a time-biased gain measure, which explicitly accommodates such aspects of the search process. By conducting an appropriate user study, we calibrate and validate the measure against the TREC 2005 Robust Track test collection. We examine properties of the measure, contrasting it to traditional effectiveness measures, and exploring its extension to other aspects and environments. As its primary benefit, the measure allows us to evaluate system performance in human terms, while maintaining the simplicity and repeatability of system-oriented tests. Overall, we aim to achieve a clearer connection between user-oriented studies and system-oriented tests, allowing us to better transfer insights and outcomes from one to the other.

Author(s): Mark Smucker, Charles Clarke

Available at: http://dl.acm.org/citation.cfm?id=2348300

Discussion Leader: Jeremy Steinhauer


Friday, 26 October 2012 @ 10:00 AM

Title: Human-Powered Sorts and Joins

Crowdsourcing markets like Amazon’s Mechanical Turk (MTurk) make it possible to task people with small jobs, such as labeling images or looking up phone numbers, via a programmatic interface. MTurk tasks for processing datasets with humans are currently designed with significant reimplementation of common workflows and ad-hoc selection of parameters such as price to pay per task. We describe how we have integrated crowds into a declarative workflow engine called Qurk to reduce the burden on workflow designers. In this paper, we focus on how to use humans to compare items for sorting and joining data, two of the most common operations in DBMSs. We describe our basic query interface and the user interface of the tasks we post to MTurk. We also propose a number of optimizations, including task batching, replacing pairwise comparisons with numerical ratings, and pre-filtering tables before joining them, which dramatically reduce the overall cost of running sorts and joins on the crowd. In an experiment joining two sets of images, we reduce the overall cost from $67 in a naive implementation to about $3, without substantially affecting accuracy or latency. In an end-to-end experiment, we reduced cost by a factor of 14:5.

Author(s): Adam Marcus, Eugene Wu, David Karger, Samuel Madden, Robert Miller

Available at: http://dl.acm.org/citation.cfm?id=2047487

Discussion Leader: Scott Britell


Friday, 19 October 2012 @ 10:00 AM

Title: SheetDiff: A Tool for Identifying Changes in Spreadsheets

2010 IEEE Symposium on Visual Languages and Human-Centric Computing

Author(s): Chambers, C., Erwig, M., & Luckey, M.

Available at: http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=5635197&tag=1

Discussion Leader: Abdussalam Alawini


Friday, 28 September 2012 @ 10:00 AM

Title: Paper-choosing Session

Author(s):

Available at: #

Discussion Leader:


Friday, 31 August 2012 @ 10:00 AM

Title: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals

Data Mining and Knowledge Discovery 1997

Author(s): Jim Gray, Surajit Chaudhuri, Adam Bosworth, Andrew Layman, Don Reichart, Murali Venkatrao, Frank Pellow, and Hamid Pirahesh

Available at: http://www.springerlink.com/content/l105x6337j100052/fulltext.pdf

Discussion Leader: Veronika Megler


Friday, 24 August 2012 @ 10:00 AM

Title: OPTICS: ordering points to identify the clustering structure

SIGMOD 1999

Author(s): Mihael Ankerst, Markus M. Breunig, Hans-Peter Kriegel, and Jörg Sander

Available at: http://www.dbs.informatik.uni-muenchen.de/~breunig/HomepageResearch/Papers/OPTICS.pdf

Discussion Leader: Jeremy Steinhauer


Friday, 17 August 2012 @ 10:00 AM

Title: Change patterns and change support features – Enhancing flexibility in process-aware information systems

DKE 2008

Author(s): Barbara Weber, Manfred Reichert, Stefanie Rinderle-Ma

Available at: http://www.business.unr.edu/faculty/kuechler/788/processAwareInfoSys.pdf

Discussion Leader: Christoph Schuetz


Friday, 10 August 2012 @ 10:00 AM

Title: A Semantic Approach to Discovering Schema Mapping Expressions

ICDE 2007

Author(s): An, Y.,Borgida, A.,Miller, R.J. and Mylopoulos, J.

Available at: http://www.cs.toronto.edu/~miller/papers/ABMM07.pdf

Discussion Leader: Lois Delcambre


Friday, 03 August 2012 @ 10:00 AM

Title: NoDB: efficient query execution on raw data files

SIGMOD 2012

Author(s): Ioannis Alagiannis, Renata Borovica, Miguel Branco, Stratos Idreos, and Anastasia Ailamaki

Available at: http://infoscience.epfl.ch/record/175803/files/NoDBsigmod2012.pdf

Discussion Leader: Scott Britell


Friday, 20 July 2012 @ 10:00 AM

Title: The design of the force.com multitenant internet application development platform

SIGMOD 2009

Author(s): Craig D. Weissman and Steve Bobrowski

Available at: http://cloud.pubs.dbs.uni-leipzig.de/sites/cloud.pubs.dbs.uni-leipzig.de/files/p889-weissman-1.pdf

Discussion Leader: Bryon Nevis


Friday, 13 July 2012 @ 10:00 AM

Title: Fuzzy querying of incomplete, imprecise, and heterogeneously structured data in the relational model using ontologies and rules

IEEE Transactions on Fuzzy Systems, Volume 13, Issue 3

Author(s): Buche, P., Dervin, C., Haemmerle, O., Thomopoulos, R.

Available at: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?tp=&arnumber=1439523

Discussion Leader: Abdussalam Alawini


Friday, 06 July 2012 @ 10:00 AM

Title: Temporal Analytics on Big Data for Web Advertising

ICDE 2012

Author(s): Badrish Chandramouli, Jonathan Goldstein, and Songyun Duan

Available at: http://research.microsoft.com/pubs/155806/timr-icde2012.pdf

Discussion Leader: David Maier


Friday, 29 June 2012 @ 10:00 AM

Title: Paper-choosing Session

Author(s):

Available at: #

Discussion Leader:


Friday, 08 June 2012 @ 10:00 AM

Title: Techniques for Efficiently Querying Scientific Workflow Provenance Graphs

EDBT 2010

Author(s): Manish Kumar Anand, Shawn Bowers, Bertram Lud?scher

Available at: http://www.cs.gonzaga.edu/~bowers/papers/edbt-2010.pdf

Discussion Leader: Lois Delcambre


Friday, 01 June 2012 @ 10:00 AM

Title: Searching with Numbers

WWW 2002

Author(s): Rakesh Agrawal and Ramakrishnan Srikant

Available at: http://rakesh.agrawal-family.com/papers/www02sbn.pdf

Discussion Leader: Veronika Megler


Friday, 25 May 2012 @ 10:00 AM

Title: Storing Matrices on Disk: Theory and Practice Revisited

VLDB 2011

Author(s): Yi Zhang, Kamesh Munagala, and Jun Yang

Available at: http://www.vldb.org/pvldb/vol4/p1075-zhang.pdf

Discussion Leader: Patrick Leyshock


Friday, 18 May 2012 @ 10:00 AM

Title: Uniform Access to Non-Relational Database Systems: the SOS Platform

CAiSE 2012

Author(s): Atzeni, Bugiotti, Rossi

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Michael Grossniklaus


Friday, 11 May 2012 @ 10:00 AM

Title: No reading group this week - RPE presentations in FAB 86-01

Author(s):

Available at: #

Discussion Leader:


Friday, 04 May 2012 @ 10:00 AM

Title: OLAP query reformulation in peer-to-peer data warehousing

Information Systems, Volume 37, Issue 5, July 2012

Author(s): M. Golfarelli, F. Mandreoli, W. Penzo, S. Rizzi, E. Turricchia

Available at: http://www.sciencedirect.com/science/article/pii/S0306437911000822

Discussion Leader: Christoph Schuetz


Friday, 27 April 2012 @ 10:00 AM

Title: Graph Pattern Matching: A Join/Semijoin Approach

TKDE Volume: 23 Issue:7 2011

Author(s): Jiefeng Cheng, Jeffrey Xu Yu, and Philip S. Yu

Available at: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5582090

Discussion Leader: Dave Maier


Friday, 20 April 2012 @ 10:00 AM

Title: Recovering Semantics of Tables on the Web

VLDB 2011

Author(s): Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Pa?ca, Warren Shen, Fei Wu, Gengxin Miao, and Chung Wu

Available at: http://static.googleusercontent.com/external_content/untrusted_dlcp/research.google.com/en/us/pubs/archive/37232.pdf

Discussion Leader: Alon Halevy


Friday, 13 April 2012 @ 10:00 AM

Title: No reading group this week - Faculty Candidate Talk in FAB 86-01

Author(s):

Available at: http://www.pdx.edu/computer-science/charles-wright

Discussion Leader:


Friday, 06 April 2012 @ 10:00 AM

Title: Paper Choosing Session - Talk by Christoph Schuetz

Author(s):

Available at: #

Discussion Leader:


Friday, 23 March 2012 @ 10:00 AM

Title: An empirical characterization of stream programs and its implications for language and compiler design

PACT 2010

Author(s): William Thies and Saman Amarasinghe

Available at: http://groups.csail.mit.edu/commit/papers/2010/thies-pact10.pdf

Discussion Leader: Kristin Tufte


Friday, 16 March 2012 @ 10:00 AM

Title: From Spreadsheets to Relational Databases and Back

2009 ACM SIGPLAN workshop on Partial evaluation and program manipulation

Author(s): Jacome Cunha, Joao Saraiva, and Joost Visser

Available at: http://dl.acm.org/citation.cfm?id=1480972

Discussion Leader: Abdussalam Alawini


Friday, 09 March 2012 @ 10:00 AM

Title: Model-driven Development of Context-Aware Web Applications

TOIT Volume 7, Number 1, February 2007

Author(s): Stefano Ceri, Florian Daniel, Maristella Matera, Federico M. Facca

Available at: http://www.floriandaniel.it/university/ops/download.php?oid=18

Discussion Leader: Scott Britell


Friday, 02 March 2012 @ 10:00 AM

Title: Understanding Queries in a Search Database System

PODS 2010

Author(s): Ronald Fagin , Benny Kimelfeld , Yunyao Li , Sriram Raghavan

Available at: http://www.almaden.ibm.com/cs/people/fagin/pods10.pdf

Discussion Leader: Veronika Megler


Friday, 24 February 2012 @ 10:00 AM

Title: Fast and accurate estimation of shortest paths in large graphs

CIKM 2010

Author(s): Andrey Gubichev, Srikanta Bedathur, Stephan Seufert, and Gerhard Weikum

Available at: http://www.mpi-inf.mpg.de/~sseufert/papers/aspsn-cikm.pdf

Discussion Leader: Dave Maier


Friday, 17 February 2012 @ 10:00 AM

Title: Understanding digital library adoption: a use diffusion approach

JCDL 2011

Author(s): Keith E. Maull, Manuel Gerardo Saldivar, and Tamara Sumner

Available at: http://www.cs.colorado.edu/~lizb/phd/sumner1.pdf

Discussion Leader: Jeremy Steinhauer


Friday, 10 February 2012 @ 10:00 AM

Title: Scalable SPARQL Querying of Large RDF Graphs

VLDB 2011

Author(s): Jiewen Huang, Daniel J. Abadi, and Kun Ren

Available at: http://www.vldb.org/pvldb/vol4/p1123-huang.pdf

Discussion Leader: Michael Grossniklaus


Friday, 03 February 2012 @ 10:00 AM

Title: Databases will Visualize Queries too

VLDB 2011

Author(s): W. Gatterbauer

Available at: http://www.andrew.cmu.edu/user/gatt/download/vldb2011_Database_Query_Visualization.pdf

Discussion Leader: Len Shapiro


Friday, 03 February 2012 @ 10:00 AM

Title: Using Data for Systemic Financial Risk Management

CIDR 2011

Author(s): Mark Flood, HV Jagadish, Albert Kyle, Frank Olken, and Louiqa Raschid

Available at: http://www.cidrdb.org/cidr2011/Papers/CIDR11_Paper16.pdf

Discussion Leader: Len Shapiro


Friday, 03 February 2012 @ 10:00 AM

Title: Computational Journalism: A Call to Arms to Database Researchers

CIDR 2011

Author(s): Sarah Cohen, Chengkai Li, Jun Yang, and Cong Yu

Available at: http://www.cidrdb.org/cidr2011/Papers/CIDR11_Paper17.pdf

Discussion Leader: Len Shapiro


Friday, 27 January 2012 @ 10:00 AM

Title: Ricardo: integrating R and Hadoop

SIGMOD 2010

Author(s): Sudipto Das, Yannis Sismanis, Kevin S. Beyer, Rainer Gemulla, Peter J. Haas, and John McPherson

Available at: http://citeseerx.ist.psu.edu/viewdoc/download;jsessionid=15C748C4FB9FA6A622CCB68422389003?doi=10.1.1.186.750&rep=rep1&type=pdf

Discussion Leader: Patrick Leyshock


Friday, 20 January 2012 @ 10:00 AM

Title: Towards a pattern science for the Semantic Web

Semantic Web Volume 1, Number 1-2 / 2010

Author(s): Aldo Gangemi and Valentina Presutti

Available at: http://iospress.metapress.com/content/6038458357588v43/fulltext.pdf

Discussion Leader: Lois Delcambre


Friday, 09 December 2011 @ 10:00 AM

Title: A Parameterized Representation of Uncertain Conceptual Spaces

Transactions in GIS, 2004

Author(s): Ola Ahlqvist

Available at: http://onlinelibrary.wiley.com/doi/10.1111/j.1467-9671.2004.00198.x/abstract

Discussion Leader: Veronika Megler


Friday, 02 December 2011 @ 10:00 AM

Title: Discovering HERM

Author(s): Bernhard Thalheim

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre


Friday, 18 November 2011 @ 10:00 AM

Title: Variable Length Compression for Bitmap Indices

DEXA 2011

Author(s): Fabian Corrales, David Chiu and Jason Sawin

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: David Chiu


Friday, 04 November 2011 @ 10:00 AM

Title: Synthesizing Products for Online Catalogs

VLDB 2011

Author(s): Hoa Nguyen, Ariel Fuxman, Stelios Paparizos, Juliana Freire, Rakesh Agrawal

Available at: http://www.vldb.org/pvldb/vol4/p409-nguyen.pdf

Discussion Leader: Kristin Tufte


Friday, 28 October 2011 @ 10:00 AM

Title: Find it if you can: a game for modeling different types of web search success using interaction data

SIGIR 2011

Author(s): Mikhail Ageev, Qi Guo, Dmitry Lagun, and Eugene Agichtein

Available at: http://dl.acm.org/citation.cfm?id=2009965

Discussion Leader: Jeremy Steinhauer


Friday, 21 October 2011 @ 10:00 AM

Title: Entity-relationship queries over wikipedia

SMUC 2010

Author(s): Xiaonan Li, Chengkai Li, and Cong Yu

Available at: http://dl.acm.org/citation.cfm?id=1871991

Discussion Leader: Scott Britell


Friday, 14 October 2011 @ 10:00 AM

Title: Pregel: a system for large-scale graph processing

SIGMOD 2010

Author(s): Grzegorz Malewicz, Matthew H. Austern, Aart J.C Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski

Available at: http://dl.acm.org/citation.cfm?id=1807184

Discussion Leader: Dave Maier


Friday, 07 October 2011 @ 10:00 AM

Title: Bridging Two Worlds with RICE

VLDB 2011

Author(s): Philipp Grosse, Wolfgang Lehner, Thomas Weichert, Franz Farber, Wen-Syan

Available at: http://www.vldb.org/pvldb/vol4/p1307-grosse.pdf

Discussion Leader: Patrick Leyshock


Friday, 30 September 2011 @ 10:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:


Friday, 02 September 2011 @ 10:00 AM

Title: Model-independent schema and data translation

A runtime approach to model-generic translation of schema and data is proposed. It is based on our previous work on MIDST, a platform conceived to perform translations in an off-line fashion. In the original approach, the source database is imported into a dictionary, where it is stored according to a universal model. Then, the translation is applied within the tool as a composition of elementary transformation steps, specified as Datalog programs. Finally, the result is exported into the operational system. Here we illustrate a new, lightweight approach where the database is not imported. The tool needs only to know the model and the schema of the source database and generates views on the operational system that transform the underlying data (stored in the source schema) according to the corresponding schema in the target model. Views are generated in an almost automatic way, on the basis of the Datalog rules for schema translation. Current work on extensions of the approach to the family of the so called noSQL systems will also be sketched.

Author(s):

Available at: http://www.dia.uniroma3.it/~atzeni/psfiles/MIDSTtoAppear.pdf

Discussion Leader: Paolo Atzeni


Friday, 26 August 2011 @ 10:00 AM

Title: Learning in Query Optimization

Database Systems let users specify queries in a declarative language like SQL. Most modern DBMS optimizers rely upon a cost model to choose the best query execution plan (QEP) for any given query. Cost estimates are heavily dependent upon the optimizers estimates for the number of rows that will result at each step of the QEP for complex queries involving many predicates and/or operations. These estimates, in turn, rely upon statistics on the database and modeling assumptions that may or may not be true for a given database. In my talk, I will present an overview of the research on learning in query optimization. I will introduce the concept of a LEarning Optimizer (LEO) as a comprehensive way to repair incorrect statistics and cardinality estimates of a query execution plan. By monitoring executed queries, LEO compares the optimizers estimates with actuals at each step in a QEP, and computes adjustments to cost estimates and statistics that may be used during the current and future query optimizations. LEO introduces a feedback loop to query optimization that enhances the available information on the database where the most queries have occurred, allowing the optimizer to actually learn from its past mistakes. In the second part of the talk, I describe how the knowledge gleaned by LEO is exploited consistently in a query optimizer, by adjusting the optimizers model and by maximizing information entropy. In the third part of the talk, I will briefly sketch my current research work and vision on Information Management in the Cloud in the Stratosphere (massively parallel and distributed processing) and MIA (information marketplace) projects.

Author(s):

Available at: http://portal.acm.org/ft_gateway.cfm?id=1142586&type=pdf

Discussion Leader: Volker Markl


Friday, 19 August 2011 @ 10:00 AM

Title: Semantic Stream Query Optimization Exploiting Dynamic Metadata

ICDE 2011

Author(s): Luping Ding, Karen Works, Elke A. Rundensteiner

Available at: http://davis.wpi.edu/dsrg/PROJECTS/QM/documents/Herald.pdf

Discussion Leader: David Maier


Friday, 12 August 2011 @ 10:00 AM

Title: Automatic schema merging using mapping constraints among incomplete sources

CIKM 2010

Author(s): Xiang Li, Christoph Quix, David Kensche, and Sandra Geisler

Available at: http://portal.acm.org/ft_gateway.cfm?id=1871479&type=pdf

Discussion Leader: Scott Britell


Friday, 05 August 2011 @ 10:00 AM

Title: Design and Implementation of Verifiable Audit Trails for a Versioning File System

USENIX FAST 2007

Author(s): Zachary N. J. Peterson, Randal Burns, Giuseppe Ateniese, and Stephen Bono

Available at: http://www.usenix.org/events/fast07/tech/full_papers/peterson/peterson.pdf

Discussion Leader: David Archer


Friday, 29 July 2011 @ 10:00 AM

Title: MonetDB/SQL Meets SkyServer: The Challenges of a Scientific Database

SSDBM 2007

Author(s): M. Ivanova, N. Nes, R. Gonclaves, and M. Kersten

Available at: http://www.rgi-otb.nl/geoinfoned/documents/Ivanova2007.pdf

Discussion Leader: Patrick Leyshock


Friday, 15 July 2011 @ 10:00 AM

Title: Improving Recommender Systems by Incorporating Social Contextual Information

ACM TOIS Vol. 29, No. 2, 2011

Author(s): Hao Ma, Tom Chao Zhou, Michael R. Lyu, Irwin King

Available at: http://portal.acm.org/ft_gateway.cfm?id=1961212&type=pdf

Discussion Leader: Jeremy Steinhauer


Friday, 08 July 2011 @ 10:00 AM

Title: A Generic Database Schema for CIDOC-CRM Data Management

ADBIS 2011

Author(s): Kai Jannaschk, Claas Anders Rathje, Bernhard Thalheim, and Frank Förster

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre


Friday, 01 July 2011 @ 10:00 AM

Title: A method to map heterogeneity between near but non-equivalent semantic attributes in multiple health data registries

Health Informatics Journal Vol. 14 No. 1 2008

Author(s): Nadine Schuurman and Agnieszka Leszczynski

Available at: http://www.sfu.ca/gis/schuurman/cv/PDF/27-45%20JHI_086333%20Shuurman.pdf

Discussion Leader: Veronika Megler


Friday, 24 June 2011 @ 10:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone


Friday, 17 June 2011 @ 10:00 AM

Title: A Position Paper on Data Sovereignty: The Importance of Geolocating Data in the Cloud

USENIX HotCloud 11

Author(s): Zachary N.J. Peterson, Mark Gondree, and Robert Beverly

Available at: http://znjp.com/papers/peterson-hotcloud11.pdf

Discussion Leader: Zachary Peterson


Friday, 03 June 2011 @ 10:00 AM

Title: Hybrid Merge/Overlap Execution Technique for Parallel Array Processing

EDBT/ICDT Array Databases Workshop 2011

Author(s): Emad Soroush and Magdalena Balazinska

Available at: http://www.edbt.org/Proceedings/2011-Uppsala/papers/workshops/arraydb_workshop/a3-soroush.pdf

Discussion Leader: Patrick Leyshock


Friday, 20 May 2011 @ 10:00 AM

Title: Social media recommendation based on people and tags

SIGIR 2010

Author(s): I Guy, N Zwerdling, I Ronen, D Carmel, and E Uziel

Available at: http://portal.acm.org/ft_gateway.cfm?id=1835484&type=pdf&CFID=16120849&CFTOKEN=48728937

Discussion Leader: Jeremy Steinhauer


Friday, 06 May 2011 @ 10:00 AM

Title: C-MR: A Continuous-MapReduce Processing Model for Low-Latency Stream Processing on Multi-Core Architectures

Technical Report CS-10-01, Brown University, Feb. 2010

Author(s): N. Backman, K. Pattabiraman, U. Cetintemel

Available at: ftp://ftp.cs.brown.edu/pub/techreports/10/cs10-01.pdf

Discussion Leader: Michael Grossniklaus


Friday, 29 April 2011 @ 10:00 AM

Title: Towards Practical Incremental Recomputation for Scientists: An Implementation for the Python Language

TaPP 2010

Author(s): Philip J. Guo and Dawson Engler

Available at: http://www.usenix.org/event/tapp10/tech/full_papers/guo.pdf

Discussion Leader: David Archer


Friday, 22 April 2011 @ 10:00 AM

Title: How Soccer Players Would Do Stream Joins

SIGMOD 2011

Author(s): Jens Teubner and Rene Mueller

Available at: http://people.inf.ethz.ch/jteubner/publications/soccer-players/soccer-players.pdf

Discussion Leader: Kristin Tufte


Friday, 15 April 2011 @ 10:00 AM

Title: A Generalized Join Algorithm

BTW 2011

Author(s): Goetz Graefe

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Len Shapiro


Friday, 08 April 2011 @ 10:00 AM

Title: A co-Relational Model of Data for Large Shared Data Banks

ACM QUEUE 2011

Author(s): Erik Meijer and Gavin Bierman

Available at: http://portal.acm.org/ft_gateway.cfm?id=1961297&type=pdf

Discussion Leader: David Maier


Friday, 18 March 2011 @ 10:00 AM

Title: Evolution and Future Directions of Large Scale Storage and Computation Systems at Google

SoCC 2010

Author(s): Jeffrey Dean

Available at: http://tinyurl.com/28l9mtt

Discussion Leader: Len Shapiro


Friday, 11 March 2011 @ 10:00 AM

Title: Indexing Multi-dimensional Data in a Cloud System

SIGMOD 2010

Author(s): Jinbao Wang, Sai Wu, Hong Gao, Jianzhong Li, and Beng Chin Ooi

Available at: http://www.comp.nus.edu.sg/~ooibc/sigmod10rtcan.pdf

Discussion Leader: Travis Hall


Friday, 04 March 2011 @ 10:00 AM

Title: Scalable Data Integration by Mapping Data to Queries

Technical Report 633, Dept. of Computer Science, ETH Zurich, July 2009

Author(s): Martin Hentschel, Donald Kossman, Daniela Florescu, Laura Haas, Tim Kraska, and Renee J. Miller

Available at: ftp://ftp.inf.ethz.ch/pub/publications/tech-reports/6xx/633.pdf

Discussion Leader: Scott Britell


Friday, 25 February 2011 @ 10:00 AM

Title: Incorporating Partitioning and Parallel Plans into the SCOPE Optimizer

ICDE 2010

Author(s): Jingren Zhou, Per-Ake Larson, and Ronnie Chaiken

Available at: http://research.microsoft.com/en-us/um/people/jrzhou/pub/pgs.pdf

Discussion Leader: Michael Grossniklaus


Friday, 18 February 2011 @ 10:00 AM

Title: In Support of Mesodata in Database Management Systems

LNCS vol. 3180, 2004

Author(s): Denise de Vries, Sally Rice, and John F. Roddick

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Veronika Megler


Friday, 11 February 2011 @ 10:00 AM

Title: An Architecture for Recycling Intermediates in a Column-store

SIGMOD 2009

Author(s): Milena G. Ivanova, Martin L. Kersten, Niels J. Nes, and Romulo A.P. Goncalves

Available at: http://oai.cwi.nl/oai/asset/14287/14287A.pdf

Discussion Leader: Patrick Leyshock


Friday, 04 February 2011 @ 10:00 AM

Title: Secure kNN Computation on Encrypted Databases

SIGMOD 2009

Author(s): Wai Kit Wong, David Wai-lok Cheung, Ben Kao, and Nikos Mamoulis

Available at: http://portal.acm.org/citation.cfm?id=1559862

Discussion Leader: Farhana Kabir


Friday, 28 January 2011 @ 10:00 AM

Title: Consistency Analysis in Bloom: a CALM and Collected Approach

CIDR 2011

Author(s): Peter Alvaro, Neil Conway, Joseph M. Hellerstein, and William R. Marczak

Available at: http://neilconway.org/docs/bloom_calm_cidr11.pdf

Discussion Leader: David Maier


Friday, 21 January 2011 @ 10:00 AM

Title: G-Store: A Scalable Data Store for Transactional Multi key Access in the Cloud

SoCC 2010

Author(s): Sudipto Das, Divyakant Agrawal, and Amr El Abbadi

Available at: http://portal.acm.org/citation.cfm?id=1807157

Discussion Leader: David Chiu


Friday, 14 January 2011 @ 10:00 AM

Title: Analyzing the Energy Efficiency of a Database Server

SIGMOD 2010

Author(s): Dimitris Tsirogiannis, Stavros Harizopoulos, and Mehul A. Shah

Available at: http://nms.csail.mit.edu/~stavros/pubs/energy_sigmod10.pdf

Discussion Leader: Kristen Tufte


Friday, 10 December 2010 @ 10:00 AM

Title: APEX: An Adaptive Path Index for XML Data

SIGMOD 2002

Author(s): Chin-Wan Chung, Jun-Ki Min, and Kyuseok Shim

Available at: http://dx.doi.org/10.1145/564691.564706

Discussion Leader: David Archer


Friday, 03 December 2010 @ 10:00 AM

Title: Horizontally Scalable Data Stores

Author(s): Rick Cattell

Available at: http://www.cattell.net/datastores/Datastores.pdf

Discussion Leader: Len Shapiro


Friday, 19 November 2010 @ 10:00 AM

Title: Large-scale Incremental Processing Using Distributed Transactions and Notifications

OSDI 2010

Author(s): Daniel Peng and Frank Dabek

Available at: http://www.usenix.org/events/osdi10/tech/full_papers/Peng.pdf

Discussion Leader: Scott Britell


Friday, 12 November 2010 @ 10:00 AM

Title: Continuous Subgraph Pattern Search over Certain and Uncertain Graph Streams

TKDE 22:8, August 2010

Author(s): Lei Chen and Changliang Wang

Available at: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=5453378

Discussion Leader: Jeremy Steinhauer


Friday, 05 November 2010 @ 10:00 AM

Title: Capturing the Uncertainty of Moving-Object Representations

SSD 1999

Author(s): Dieter Pfoser and Christian S. Jensen

Available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.17.4724&rep=rep1&type=pdf

Discussion Leader: Veronika Megler


Friday, 29 October 2010 @ 10:00 AM

Title: SECRET: A Model for Analysis of the Execution Semantics of Stream Processing Systems

VLDB 2010

Author(s): Irina Botan, Roozbeh Derakhshan, Nihal Dindar, Laura Haas, Renee J. Miller, and Nesime Tatbul

Available at: http://www.vldb2010.org/proceedings/files/papers/R20.pdf

Discussion Leader: David Maier


Friday, 22 October 2010 @ 3:00 PM

Title: UPI: A Primary Index for Uncertain Databases

VLDB 2010

Author(s): Hideaki Kimura, Samuel Madden, and Stanley B. Zdonik

Available at: http://db.csail.mit.edu/pubs/upi-cr.pdf

Discussion Leader: Patrick Leyshock


Friday, 15 October 2010 @ 10:00 AM

Title: Manimal: Relational Optimization for Data-Intensive Programs

WebDB 2010

Author(s): Michael J. Cafarella and Christopher Re

Available at: http://pages.cs.wisc.edu/~chrisre/papers/WebDB-Manimal.pdf

Discussion Leader: Nick Rayner


Friday, 08 October 2010 @ 10:00 AM

Title: CORADD: Correlation Aware Database Designer for Materialized Views and Indexes

VLDB 2010

Author(s): Hideaki Kimura, George Huo, Alexander Rasin, Samuel Madden, and Stanley B. Zdonik

Available at: http://www.vldb2010.org/proceedings/files/papers/R98.pdf

Discussion Leader: Lois Delcambre


Friday, 03 September 2010 @ 10:00 AM

Title: Efficient Pattern Matching over Event Streams

SIGMOD 2008

Author(s): Jagrati Agrawal, Yanlei Diao, Daniel Gyllstrom, and Neil Immerman

Available at: http://www.cs.umass.edu/~yanlei/publications/sase-sigmod08.pdf

Discussion Leader: Amit Bhat


Friday, 27 August 2010 @ 10:00 AM

Title: Provenance-Based Refresh in Data-Oriented Workflows

Stanford InfoLab Technical Report, July 2010

Author(s): Robert Ikeda, Semih Salihoglu, and Jennifer Widom

Available at: http://ilpubs.stanford.edu:8090/962/

Discussion Leader: David Archer


Friday, 20 August 2010 @ 10:00 AM

Title: Online Aggregation

SIGMOD 1997

Author(s): Joseph M. Hellerstein, Peter J. Haas, and Helen J. Wang

Available at: http://doi.acm.org/10.1145/253260.253291

Discussion Leader: Len Shapiro


Friday, 13 August 2010 @ 10:00 AM

Title: An Extensible Test Framework for the Microsoft StreamInsight Query Processor

DBTest 2010

Author(s): Alex Raizman, Asvin Ananthanarayan, Anton Kirilov, Badrish Chandramouli, and Mohamed Ali

Available at: http://research.microsoft.com/pubs/132100/Testing%20StreamInsight.pdf

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 06 August 2010 @ 10:00 AM

Title: F-Logic: A Higher-Order Language for Reasoning About Objects, Inheritance, and Scheme

SIGMOD 1989

Author(s): Michael Kifer and Georg Lausen

Available at: http://doi.acm.org/10.1145/67544.66939

Discussion Leader: David Maier


Friday, 30 July 2010 @ 10:00 AM

Title: Towards Adaptive, Flexible, and Self-tuned Database Systems

EPFL EDIC Research Proposal, July 2010

Author(s): Ioannis Alagiannis

Available at: http://wiki.epfl.ch/edicpublic/documents/Candidacy%20exam/candidacy_exam_alagiannis.pdf

Discussion Leader: Dan Colish


Friday, 23 July 2010 @ 10:00 AM

Title: Querying in Highly Mobile Distributed Environments

VLDB 1992

Author(s): Tomasz Imielinski and B. R. Badrinath

Available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.91.7422&rep=rep1&type=pdf

Discussion Leader: Veronika Megler


Friday, 16 July 2010 @ 10:00 AM

Title: A Transactional Model for Long-Running Activities

VLDB 1991

Author(s): Umeshwar Dayal, Meichun Hsu, and Rivka Ladin

Available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.68.7445&rep=rep1&type=pdf

Discussion Leader: Patrick Leyshock


Friday, 09 July 2010 @ 10:00 AM

Title: BIRCH: An Efficient Data Clustering Method for Very Large Databases

SIGMOD Record, June 1996

Author(s): Tian Zhang, Raghu Ramakrishnan, and Miron Livny

Available at: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.93.2882&rep=rep1&type=pdf

Discussion Leader: Jeremy Steinhauer


Friday, 02 July 2010 @ 10:00 AM

Title: Making Web Annotations Persistent over Time

JCDL 2010

Author(s): Robert Sanderson and Herbert Van de Sompel

Available at: http://arxiv.org/pdf/1003.2643

Discussion Leader: Lois Delcambre


Friday, 11 June 2010 @ 10:00 AM

Title: Transformation of Continuous Aggregation Join Queries over Data Streams

SSTD 2007

Author(s): Tri Minh Tran and Byung Suk Lee

Available at: http://portal.acm.org/citation.cfm?id=1784462.1784481

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 04 June 2010 @ 10:00 AM

Title: A Statistical Comparison of Tag and Query Logs

SIGIR 2009

Author(s): Mark J. Carman, Mark Baillie, Robert Gwadera, and Fabio Crestani

Available at: http://doi.acm.org/10.1145/1571941.1571965

Discussion Leader: Jeremy Steinhauer


Friday, 28 May 2010 @ 10:00 AM

Title: CodeQuest: Scalable Source Code Queries with Datalog

ECOOP 2006

Author(s): Elnar Hajiyev, Mathieu Verbaere, and Oege de Moor

Available at: http://www.springerlink.com/content/p438874736342611/

Discussion Leader: Nick Rayner


Friday, 21 May 2010 @ 10:00 AM

Title: JouleSort: A Balanced Energy-Efficient Benchmark

SIGMOD 2007

Author(s): Suzanne Rivoire, Mehul Shah, Parthasarathy Ranganathan, and Christos Kozyrakis

Available at: http://www.hpl.hp.com/personal/Mehul_Shah/papers/sigmod_2007_rivoire.pdf

Discussion Leader: Len Shapiro


Friday, 30 April 2010 @ 10:00 AM

Title: A Graph Model of Data and Workflow Provenance

TAPP 2010

Author(s): Umut Acar, Peter Buneman, James Cheney, Jan Van den Bussche, Natalia Kwasnikowska, and Stijn Vansummeren

Available at: http://www.usenix.org/event/tapp10/tech/full_papers/buneman.pdf

Discussion Leader: David Archer


Friday, 23 April 2010 @ 10:00 AM

Title: Composition and Inversion of Schema Mappings

SIGMOD Record, September 2009

Author(s): Marcelo Arenas, Jorge Perez, Juan Reutter, and Cristian Riveros

Available at: http://www.sigmod.org/publications/sigmod-record/0909/p17.principles.arenas.pdf/view

Discussion Leader: Dan Colish


Friday, 16 April 2010 @ 10:00 AM

Title: Exploiting Predicate-Window Semantics over Data Streams

SIGMOD Record, March 2006

Author(s): Thanaa M. Ghanem, Walid G. Aref, and Ahmed K. Elmagarmid

Available at: http://doi.acm.org/10.1145/1121995.1121996

Discussion Leader: Amit Bhat


Friday, 09 April 2010 @ 10:00 AM

Title: A General Datalog-Based Framework for Tractable Query Answering over Ontologies

PODS 2009

Author(s): Andrea Cali, Georg Gottlob, and Thomas Lukasiewicz

Available at: http://doi.acm.org/10.1145/1559795.1559809

Discussion Leader: David Maier


Friday, 19 March 2010 @ 10:00 AM

Title: USHER: Improving Data Quality with Dynamic Forms

ICDE 2010

Author(s): Kuang Chen, Harr Chen, Neil Conway, Joseph M. Hellerstein, and Tapan S. Parikh

Available at: http://neilconway.org/docs/icde2010_usher.pdf

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 12 March 2010 @ 10:00 AM

Title: Semantic Representation of Context Models: a Framework for Analyzing and Understanding

CIAO 2009

Author(s): Salma Najar, Oumaima Saidani, Manuele Kirsch-Pinheiro, Carine Souveyet, and Selmen Nurcan

Available at: http://doi.acm.org/10.1145/1552262.1552268

Discussion Leader: Nick Rayner


Friday, 05 March 2010 @ 10:00 AM

Title: A Conceptual View on Trajectories

Data & Knowledge Engineering 65:126-46, 2008

Author(s): S. Spaccapietra, C. Parent, M. Damiani, J. Macedo, F. Porta, and C. Vangenot

Available at: http://dx.doi.org/10.1016/j.datak.2007.10.008

Discussion Leader: Veronika Megler


Friday, 26 February 2010 @ 10:00 AM

Title: Object Reuse & Exchange: A Resource-Centric Approach

The Computing Research Repository April 2008

Author(s): Carl Lagoze, Herbert Van de Sompel, Michael L. Nelson, Simeon Warner, Robert Sanderson, and Pete Johnston

Available at: http://arxiv.org/abs/0804.2273

Discussion Leader: Scott Britell


Friday, 19 February 2010 @ 10:00 AM

Title: Declarative Support for Sensor Data Cleaning

International Conference on Pervasive Computing 2006

Author(s): Shawn R. Jeffery, Gustavo Alonso, Michael J. Franklin, Wei Hong, and Jennifer Widom

Available at: http://www.springerlink.com/content/4w5185l381nu48r1/

Discussion Leader: Len Shapiro


Friday, 12 February 2010 @ 10:00 AM

Title: A Framework for Semantic Link Discovery over Relational Data

CIKM 2009

Author(s): Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Renee J. Miller, and Min Wang

Available at: http://doi.acm.org/10.1145/1645953.1646084

Discussion Leader: Dan Colish


Friday, 05 February 2010 @ 10:00 AM

Title: Evolving Objects in Temporal Information Systems

Annals of Mathematics and Artificial Intelligence, June 2007

Author(s): Alessandro Artale, Christine Parent, and Stefano Spaccapietra

Available at: http://www.springerlink.com/content/j05m127q3610g602/

Discussion Leader: David Archer


Friday, 29 January 2010 @ 10:00 AM

Title: A Session Based Personalized Search Using an Ontological User Profile

SAC 2009

Author(s): Mariam Daoud, Lynda Tamine-Lechani, Mohand Boughanem, and Bilal Chebaro

Available at: http://doi.acm.org/10.1145/1529282.1529670

Discussion Leader: Jeremy Steinhauer


Friday, 22 January 2010 @ 10:00 AM

Title: Anchor Modeling

ER 2009

Author(s): O. Regardt, L. Ronnback, M. Bergholtz, P. Johannesson, and P. Wohed

Available at: http://syslab.dsv.su.se/profiles/blogs/anchor-modeling

Discussion Leader: Lois Delcambre


Friday, 15 January 2010 @ 10:00 AM

Title: Impact of Disk Corruption on Open-Source DBMS

ICDE 2010

Author(s): S. Subramanian, Y. Zhang, R. Vaiyanathan, H. S. Gunawi, A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, and J. F. Naugton

Available at: http://www.cs.wisc.edu/wind/Publications/corrupt-mysql-icde10.pdf

Discussion Leader: David Maier


Friday, 11 December 2009 @ 10:00 AM

Title: Characteristic Relational Patterns

KDD '09

Author(s): Arne Koopman and Arno Siebes

Available at: http://doi.acm.org/10.1145/1557019.1557071

Discussion Leader: Nick Rayner


Friday, 04 December 2009 @ 10:00 AM

Title: Understanding the Semantics of Data Provenance to Support Active Conceptual Modeling

Author(s): Sudha Ram and Jun Liu

Available at: http://kartik.eller.arizona.edu/ACML_Provenance_final.pdf

Discussion Leader: David Archer


Friday, 20 November 2009 @ 10:00 AM

Title: Typed Datalog

PADL '09

Author(s): David Zook, Emir Pasalic, and Beata Sarna-Starosta

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: David Maier


Friday, 13 November 2009 @ 10:00 AM

Title: A Comparison of Approaches to Large-Scale Data Analysis

SIGMOD '09

Author(s): Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. DeWitt, Samuel Madden, and Michael Stonebraker

Available at: http://doi.acm.org/10.1145/1559845.1559865

Discussion Leader: Veronika Megler


Friday, 06 November 2009 @ 11:30 AM

Title: Information Scraps: How and Why Information Eludes our Personal Information Management Tools

TOIS 26:4, September 2008

Author(s): Michael Bernstein, Max Van Kleek, David Karger, and M. C. Schraefel

Available at: http://doi.acm.org/10.1145/1402256.1402263

Discussion Leader: Amit Bhat


Friday, 30 October 2009 @ 10:00 AM

Title: Stream Warehousing with DataDepot

SIGMOD '09

Author(s): Lukasz Golab, Theodore Johnson, J. Spencer Seidel, and Vladislav Shkapenyuk

Available at: http://doi.acm.org/10.1145/1559845.1559934

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 23 October 2009 @ 10:00 AM

Title: Declarative Support for Sensor Data Cleaning

ICPC '06

Author(s): Shawn R. Jeffery, Gustavo Alonso, Michael J. Franklin, Wei Hong, and Jennifer Widom

Available at: http://www.springerlink.com/content/4w5185l381nu48r1/

Discussion Leader: Len Shapiro


Friday, 16 October 2009 @ 10:00 AM

Title: Semantics and Implementation of Continuous Sliding Window Queries Over Data Streams

TODS 34:1, April 2009

Author(s): Jurgen Kramer and Bernhard Seeger

Available at: http://doi.acm.org/10.1145/1508857.1508861

Discussion Leader: David Maier


Friday, 09 October 2009 @ 10:00 AM

Title: Reasonable Tag-based Collaborative Filtering for Social Tagging Systems

WICOW '08

Author(s): Reyn Y. Nakamoto, Shinsuke Nakajima, Jun Miyazaki, Shunsuke Uemura, Hirokazu Kato, and Youichi Inagaki

Available at: http://doi.acm.org/10.1145/1458527.1458533

Discussion Leader: Jeremy Steinhauer


Friday, 11 September 2009 @ 10:00 AM

Title: Prefilter: Predicate Pushdown at Streaming Speeds

SSPS '08

Author(s): Lukasz Golab, Theodore Johnson, and Oliver Spatscheck

Available at: http://doi.acm.org/10.1145/1379272.1379280

Discussion Leader: Amit Bhat


Friday, 21 August 2009 @ 10:00 AM

Title: Adaptive Control of Extreme-Scale Stream Processing Systems

CDCS '06

Author(s): Lisa Amini, Navendu Jain, Anshul Sehgal, Jeremy Silber, and Olivier Verscheure

Available at: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=1648858&isnumber=34569

Discussion Leader: Veronika Megler


Friday, 07 August 2009 @ 10:00 AM

Title: GDM: A New Graph Based Data Model Using Functional Abstractionx [sic]

J. Comput. Sci. & Technol., May 2006

Author(s): Sankhayan Choudhury, Nabendu Chaki, and Swapan Bhattacharya

Available at: http://dx.doi.org/10.1007/s11390-006-0430-0

Discussion Leader: David Archer


Friday, 29 May 2009 @ 10:00 AM

Title: Automatic Verification of Database-Driven Systems: A New Frontier

ICDT 2009

Author(s): Victor Vianu

Available at: http://doi.acm.org/10.1145/1514894.1514896

Discussion Leader: Nick Rayner


Friday, 22 May 2009 @ 10:00 AM

Title: Combating Spam in Tagging Systems: An Evaluation

ACM Transactions on the Web 2(3), October 2008

Author(s): Georgia Koutrika, Frans Adjie Effendi, Zoltan Gyongyi, Paul Heymann, and Hector Garcia-Molina

Available at: http://doi.acm.org/10.1145/1409220.1409225

Discussion Leader: Jeremy Steinhauer


Friday, 08 May 2009 @ 10:00 AM

Title: Containment of Conjunctive Queries on Annotated Relations

ICDT 2009

Author(s): Todd J. Green

Available at: http://db.cis.upenn.edu/DL/09/icdt2009_containment.pdf

Discussion Leader: David Archer


Friday, 01 May 2009 @ 10:00 AM

Title: Scalable Regular Expression Matching on Data Streams

SIGMOD 08

Author(s): Anirban Majumder, Rajeev Rastogi, and Sriram Vanama

Available at: http://doi.acm.org/10.1145/1376616.1376635

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 24 April 2009 @ 10:00 AM

Title: Recursive Computation of Regions and Connectivity in Networks

ICDE 2009

Author(s): Mengmeng Liu, Nicholas Taylor, Wenchao Zhou, Zachary Ives, and Boon Thau Loo

Available at: http://www.seas.upenn.edu/~boonloo/papers/maintenance_icde09.pdf

Discussion Leader: David Maier


Friday, 17 April 2009 @ 10:00 AM

Title: Transformation-based Framework for Record Matching

ICDE 2008

Author(s): Arvind Arasu, Surajit Chaudhuri, and Raghav Kaushik

Available at: http://research.microsoft.com/pubs/76150/icde08.pdf

Discussion Leader: Len Shapiro


Friday, 10 April 2009 @ 11:00 AM

Title: Fedora: An Architecture for Complex Objects and their Relationships

International Journal on Digital Libraries 6(2), April 2006

Author(s): Carl Lagoze, Sandy Payette, Edwin Shin, and Chris Wilper

Available at: http://www.springerlink.com/content/x7224797g8703g30/?p=1be63b93c3464b0491b3fea1f7b90005&pi=2

Discussion Leader: Lois Delcambre


Friday, 10 April 2009 @ 10:00 AM

Title: NCORE: Architecture and Implementation of a Flexible, Collaborative Digital Library

JCDL 08

Author(s): Dean B. Krafft, Aaron Birkland, and Ellen J. Cramer

Available at: http://doi.acm.org/10.1145/1378889.1378943

Discussion Leader: Lois Delcambre


Friday, 13 March 2009 @ 10:00 AM

Title: SCADS: Scale-Independent Storage for Social Computing Applications

CIDR 2009

Author(s): Michael Armbrust, Armando Fox, David Patterson, Nick Lanham, Beth Trushkowsky, Jesse Trutna, and Haruki Oh

Available at: http://www-db.cs.wisc.edu/cidr/cidr2009/Paper_86.pdf

Discussion Leader: Len Shapiro


Friday, 06 March 2009 @ 10:00 AM

Title: A SQL Database System for Solving Constraints

PIKM 08

Author(s): Sebastien Siva and Lesi Wang

Available at: http://doi.acm.org/10.1145/1458550.1458552

Discussion Leader: Nick Rayner


Friday, 27 February 2009 @ 10:00 AM

Title: ViP: A User-Centric View-Based Annotation Framework for Scientific Data

SSDBM 2008

Author(s): Qinglan Li, Alexandros Labrinidis, and Panos K. Chrysanthis

Available at: http://dx.doi.org/10.1007/978-3-540-69497-7_20

Discussion Leader: David Archer


Friday, 20 February 2009 @ 11:00 AM

Title: Eventually Consistent

Communications of the ACM, January 2009

Author(s): Werner Vogels

Available at: http://doi.acm.org/10.1145/1435417.1435432

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 20 February 2009 @ 10:00 AM

Title: Google's Deep Web Crawl

VLDB 2008

Author(s): Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, and Alon Halevy

Available at: http://doi.acm.org/10.1145/1454159.1454163

Discussion Leader: Len Shapiro


Friday, 13 February 2009 @ 10:00 AM

Title: RIOT: I/O-Efficient Numerical Computing without SQL

CIDR 2009

Author(s): Yi Zhang, Herodotos Herodotou, and Jun Yang

Available at: http://www-db.cs.wisc.edu/cidr/cidr2009/Paper_43.pdf

Discussion Leader: David Maier


Friday, 06 February 2009 @ 10:00 AM

Title: A Unified and Discriminative Model for Query Refinement

SIGIR 08

Author(s): Jiafeng Guo, Gu Xu, Hang Li, and Xueqi Cheng

Available at: http://doi.acm.org/10.1145/1390334.1390400

Discussion Leader: Nick Rayner


Friday, 30 January 2009 @ 10:00 AM

Title: Capturing Data Uncertainty in High-Volume Stream Processing

CIDR 2009

Author(s): Yanlei Diao, Boduo Li, Anna Liu, Liping Peng, Charles Sutton, Thanh Tran, and Michael Zink

Available at: http://www-db.cs.wisc.edu/cidr/cidr2009/Paper_91.pdf

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 23 January 2009 @ 10:00 AM

Title: Graph Database Indexing Using Structured Graph Decomposition

ICDE 2007

Author(s): David W. Williams, Jun Huan, and Wei Wang

Available at: http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=4221746&isnumber=4221635

Discussion Leader: David Archer


Friday, 16 January 2009 @ 10:00 AM

Title: Data Management for High-Throughput Genomics

CIDR 2009

Author(s): Uwe Rohm and Jose A. Blakeley

Available at: http://www-db.cs.wisc.edu/cidr/cidr2009/Paper_31.pdf

Discussion Leader: David Maier


Friday, 05 December 2008 @ 10:00 AM

Title: On the Expressiveness of Implicit Provenance in Query and Update Languages

ICDT 2007

Author(s): P. Buneman, J. Cheney, S. Vansummeren

Available at: http://alpha.uhasselt.be/~lucg5855/papers/conference/icdt2007-implicitprovenance.pdf

Discussion Leader: David Archer


Friday, 21 November 2008 @ 10:00 AM

Title: Interactive Paper as a Reading Medium in Digital Libraries

ECDL 2008

Author(s): M. Morrie, B. Signer, N. Weibel

Available at: http://www.globis.ethz.ch/script/publication/download?docid=528

Discussion Leader: Jeremy Steinhauer


Friday, 14 November 2008 @ 10:00 AM

Title: Integrating Urban Form and Demographics in Water Demand MAnagement: An Empirical Case Study of Portland, Oregon (USA)

Author(s): Vivek Shandas, G. Hossein Parandavash

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Len Shapiro


Friday, 07 November 2008 @ 10:00 AM

Title: Clustera: An Integrated Computation and Data Management System

VLDB '08

Author(s): David J. DeWitt, Erik Paulson, Eric Robinson, et al.

Available at: http://www.cs.uwaterloo.ca/~ashraf/cs848/papers/Paper5.pdf

Discussion Leader: Kristen Tufte


Friday, 31 October 2008 @ 10:00 AM

Title: Schema Mapping Verification: The Spicy Way

EDBT '08

Author(s): A. Bonifati, G. Mecca, A. Pappalardo, et al.

Available at: http://doi.acm.org/10.1145/1353343.1353358

Discussion Leader: Nick Rayner


Friday, 24 October 2008 @ 10:00 AM

Title: Towards a Streaming SQL Standard

VLDB '08

Author(s): Namit Jain, Johannes Gehrke, Jennifer Widom, et al.

Available at: http://www.jgaa.info/~ugur/streamsql.pdf

Discussion Leader: Raphael J. Fernandez-Moctezuma


Friday, 10 October 2008 @ 10:00 AM

Title: Flying Fixed-Point: Recursive Processing in Stream Queries

Author(s): Johnathan Goldstein and David Maier

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: David Maier


Friday, 15 August 2008 @ 10:00 AM

Title: Flexible and Efficient IR Using Array Databases

The VLDB Journal, January 2008

Author(s): Roberto Cornacchia, Sandor Heman, Marcin Zukowski, Arjen P. Vries, Peter Boncz

Available at: http://dx.doi.org/10.1007/s00778-007-0071-0

Discussion Leader: Dave Maier


Friday, 08 August 2008 @ 10:00 AM

Title: Pay-as-You-Go User Feedback for Dataspace Systems

SIGMOD '08

Author(s): Shawn R. Jeffery, Michael J. Franklin, and Alon Y. Halevy

Available at: http://doi.acm.org/10.1145/1376616.1376701

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 01 August 2008 @ 10:00 AM

Title: Provenance Management in Curated Databases

SIGMOD '06

Author(s): Peter Buneman, Adriane Chapman, and James Cheney

Available at: http://doi.acm.org/10.1145/1142473.1142534

Discussion Leader: David Archer


Friday, 25 July 2008 @ 10:00 AM

Title: Bootstrapping Pay-as-You-Go Data Integration Systems

SIGMOD '08

Author(s): Anish Das Sarma, Xin Dong, and Alon Halevy

Available at: http://doi.acm.org/10.1145/1376616.1376702

Discussion Leader: Nick Rayner


Friday, 18 July 2008 @ 10:00 AM

Title: Extreme Visualization: Squeezing a Billion Records into a Million Pixels

SIGMOD '08

Author(s): Ben Shneiderman

Available at: http://doi.acm.org/10.1145/1376616.1376618

Discussion Leader: Len Shapiro


Friday, 11 July 2008 @ 10:00 AM

Title: A Visual Environment for Dynamic Web Application Composition

Proceedings of the Fourteenth ACM Conference on Hypertext and Hypermedia (HYPERTEXT '03)

Author(s): Kimihito Ito and Yuzuru Tanaka

Available at: http://doi.acm.org/10.1145/900051.900092

Discussion Leader: Lois Delcambre


Friday, 13 June 2008 @ 10:00 AM

Title: GPUTeraSort: High Performance Graphics Coprocessor Sorting for Large Database Management

SIGMOD '06

Author(s): Naga K. Govindaraju, Jim Gray, Ritesh Kumar, and Dinesh Manocha

Available at: http://research.microsoft.com/research/pubs/view.aspx?msr_tr_id=MSR-TR-2005-183

Discussion Leader: Len Shapiro


Friday, 06 June 2008 @ 10:00 AM

Title: A Security Punctuation Framework for Enforcing Access Control on Streaming Data

ICDE '08

Author(s): Rimma V. Nehme, Elke A. Rundensteiner, and Elisa Bertino

Available at: http://www.cs.purdue.edu/homes/rnehme/papers/icde08sp.pdf

Discussion Leader: Dave Maier


Friday, 30 May 2008 @ 10:00 AM

Title: Sketching Probabilistic Data Streams

SIGMOD '07

Author(s): Graham Cormode, and Minos Garofalakis

Available at: http://www.cs.berkeley.edu/~minos/Papers/sigmod07pstreams.pdf

Discussion Leader: Rafael J. Fernández-Moctezuma


Friday, 23 May 2008 @ 10:00 AM

Title: Sideways Information Passing for Push-Style Query Processing

ICDE '08

Author(s): Zachary G. Ives, and Nicholas E. Taylor

Available at: http://www.cis.upenn.edu/~zives/research/push.pdf

Discussion Leader: Vassilis Papadimos


Friday, 09 May 2008 @ 10:00 AM

Title: Update Exchange with Mappings and Provenance.

VLDB '07

Author(s): Green, T., Karvounarakis, G., Ives, Z., Tannen, V.

Available at: http://www.vldb.org/conf/2007/papers/research/p675-green.pdf

Discussion Leader: David Archer


Friday, 02 May 2008 @ 10:00 AM

Title: From Dirt to Shovels: Fully Automatic Tool Generation from Ad Hoc Data

POPL '08

Author(s): Kathleen Fisher, David Walker, Kenny Q. Zhu, and Peter White

Available at: http://doi.acm.org/10.1145/1328438.1328488

Discussion Leader: Nick Rayner


Friday, 25 April 2008 @ 10:00 AM

Title: Query-Aware Partitioning for Monitoring Massive Network Data Streams

SIGMOD '08

Author(s): Vladislav Shkapenyuk, Ted Johnson, Oliver Spatscheck, and S. Muthukrishnan

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Jin Li


Friday, 18 April 2008 @ 10:00 AM

Title: Bridging the Application and DBMS Profiling Divide for Database Application Developers

VLDB '07

Author(s): Surajit Chaudhuri, Vivek Narasayya, and Manoj Syamala

Available at: http://www.vldb.org/conf/2007/papers/industrial/p1252-chaudhuri.pdf

Discussion Leader: James Terwilliger


Friday, 11 April 2008 @ 10:00 AM

Title: Column-Stores vs. Row-Stores: How Different Are They Really?

SIGMOD 2008

Author(s): D. Abadi, S. Madden, N. Hachem

Available at: http://db.csail.mit.edu/pubs/ssbm.pdf

Discussion Leader: Kristin Tufte


Friday, 04 April 2008 @ 10:00 AM

Title: Web 3.0: Chicken Farms on the Semantic Web

Author(s): Jim Hendler

Available at: http://www.computer.org/portal/site/computer/menuitem.5d61c1d591162e4b0ef1bd108bcd45f3/index.jsp?&pName=computer_level1_article&TheCat=1075&path=computer/homepage/0108&file=webtech.xml&xsl=article.xsl&

Discussion Leader: Len Shapiro


Friday, 04 April 2008 @ 10:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone


Friday, 07 March 2008 @ 10:00 AM

Title: Continuous Queries in Oracle

VLDB '07

Author(s): Andrew Witkowski, Srikanth Bellamkonda, Hua-Gang Li, Vince Liang, Lei Sheng, Wayne Smith, Sankar Subramanian, James Terry, Tsae-Feng Yu

Available at: http://portal.acm.org/citation.cfm?id=1325851.1325985&coll=&dl=ACM&type=series&idx=SERIES11272&part=series&WantType=Proceedings&title=VLDB

Discussion Leader: Jin Li


Friday, 29 February 2008 @ 10:00 AM

Title: Database Virtualization: A New Frontier for Database Tuning and Physical Design

ICDE '07

Author(s): Soror, Ahmed A.; Aboulnaga, Ashraf; Salem, Kenneth

Available at: http://www.cs.uwaterloo.ca/~ashraf/pubs/smdb07virt.pdf

Discussion Leader: Len Shapiro


Friday, 22 February 2008 @ 10:00 AM

Title: Cooperative scans: dynamic bandwidth sharing in a DBMS

VLDB '07

Author(s): Marcin Zukowski and Sandor Heman and Niels Nes and Peter Boncz

Available at: http://portal.acm.org/ft_gateway.cfm?id=1325934&type=pdf&coll=&dl=ACM&CFID=11838413&CFTOKEN=90790934

Discussion Leader: Vassilis Papadimos


Friday, 15 February 2008 @ 10:00 AM

Title: Autonomously Semantifying Wikipedia

CIKM '07

Author(s): Fei Wu and Daniel Weld

Available at: http://doi.acm.org/10.1145/1321440.1321449

Discussion Leader: Susan Price


Friday, 08 February 2008 @ 10:00 AM

Title: Entity Resolution with Markov Logic

ICDM '06

Author(s): Parag Singla, and Pedro Domingos

Available at: http://ieeexplore.ieee.org/iel5/4053012/4053013/04053083.pdf?tp=&arnumber=4053083&isnumber=4053013

Discussion Leader: Nick Rayner


Friday, 01 February 2008 @ 10:00 AM

Title: Making database systems usable

SIGMOD '07

Author(s): H. V. Jagadish et al.

Available at: http://portal.acm.org/citation.cfm?id=1247480.1247483

Discussion Leader: James Terwilliger


Friday, 25 January 2008 @ 10:00 AM

Title: Consistency Sensitive Operators in CEDR

Author(s): Jonathan Goldstein, Mingsheng Hong, Mohamed Ali, and Roger Barga

Available at: http://research.microsoft.com/research/pubs/view.aspx?type=Technical%20Report&id=1401

Discussion Leader: Rafael Fernandez


Friday, 18 January 2008 @ 10:00 AM

Title: A Formal Characterization of PIVOT/UNPIVOT

CIKM '05

Author(s): Wyss, C., Robertson, E.

Available at: http://doi.acm.org/10.1145/1099554.1099709

Discussion Leader: David Archer


Friday, 11 January 2008 @ 10:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone


Friday, 07 December 2007 @ 10:00 AM

Title: Efficient Use of the Query Optimizer for Automated Physical Design

VLDB '07

Author(s): Stratos Papadomanolakis, Debabrata Dash, and Anastasia Ailamaki

Available at: http://www.cs.cmu.edu/~ddash/papers/inum_vldb.pdf

Discussion Leader: Vassilis Papadimos


Friday, 30 November 2007 @ 10:00 AM

Title: Conditional Functional Dependencies for Data Cleaning

ICDE '07

Author(s): Philip Bohannon, Wenfei Fan, Floris Geerts, Xibei Jia, and Anastasios Kementsietsidis

Available at: http://ieeexplore.ieee.org/xpls/abs_all.jsp?isnumber=4221635&arnumber=4221723&count=220&index=87

Discussion Leader: Jin Li


Friday, 16 November 2007 @ 10:00 AM

Title: Similarity Search: A Matching Based Approach

VLDB '06

Author(s): Anthony K. H. Tung, Rui Zhang, Nick Koudas, and Beng Chin Ooi

Available at: http://delivery.acm.org/10.1145/1170000/1164182/p631-tung.pdf?key1=1164182&key2=1971390911&coll=Portal&dl=ACM&CFID=956544&CFTOKEN=76804999

Discussion Leader: Rafael Fernandez


Friday, 02 November 2007 @ 10:00 AM

Title: FICSR: Feedback-based InConSistency Resolution and query processing

SIGMOD '07

Author(s): Yan Qi, K. Selcuk Candan, and Maria Luisa Sapino

Available at: http://doi.acm.org/10.1145/1247480.1247499

Discussion Leader: Nick Rayner


Friday, 26 October 2007 @ 10:00 AM

Title: Annotation Strategy

Author(s): Denise Bedford

Available at: http://ontolog.cim3.net/file/resource/reference/WorldBank-DeniseBedford-doc/Final_Annotation-Strategy.pdf

Discussion Leader: David Archer


Friday, 19 October 2007 @ 10:00 AM

Title: The Making of TPC-DS

VLDB '06

Author(s): Ragunath Othayoth, and Meikel Poess

Available at: http://www.vldb.org/conf/2006/p1049-othayoth.pdf

Discussion Leader: Len Shapiro


Friday, 12 October 2007 @ 10:00 AM

Title: A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data

VLDB '07

Author(s): Eric Chu, Akanksha Baid, Ting Chen, AnHai Doan, and Jeffrey F. Naughton

Available at: http://pages.cs.wisc.edu/~naughton/includes/papers/aRelationalApproach.pdf

Discussion Leader: David Maier


Friday, 05 October 2007 @ 10:00 AM

Title: Teaching a Schema Translator to Produce O/R Views

Author(s): Peter Mork, Philip A. Bernstein, and Sergey Melnik

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: James Terwilliger


Friday, 28 September 2007 @ 11:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone


Friday, 14 September 2007 @ 10:00 AM

Title: Scaling Games to Epic Proportions

SIGMOD '07

Author(s): Walker White, Alan Demers, Christoph Koch, Johannes Gehrke, and Rajmohan Rajagopalan

Available at: http://doi.acm.org/10.1145/1247480.1247486

Discussion Leader: Dave Maier


Friday, 07 September 2007 @ 10:00 AM

Title: Extracting entity profiles from semistructured information spaces

SIGMOD Rec. 26, 4 (Dec. 1997), 32-38

Author(s): Nado, R. A. and Huffman, S. B.

Available at: http://portal.acm.org/citation.cfm?id=271074.271083

Discussion Leader: David Archer


Friday, 24 August 2007 @ 10:00 AM

Title: Intensional Associations Between Data and Metadata

SIGMOD '07

Author(s): Divesh Srivastava, and Yannis Velegrakis

Available at: http://doi.acm.org/10.1145/1247480.1247526

Discussion Leader: Nick Rayner


Friday, 17 August 2007 @ 9:30 AM

Title: The Complex Dynamics of Collaborative Tagging

WWW 2007

Author(s): Harry Halpin, Valentin Robu, and Hana Shepherd

Available at: http://www2007.org/papers/paper635.pdf

Discussion Leader: Susan Price


Friday, 10 August 2007 @ 10:00 AM

Title: Scaling Up All Pairs Similarity Search

WWW 2007

Author(s): Roberto J. Bayardo, Yiming Ma, and Ramakrishnan Srikant

Available at: http://www2007.org/papers/paper342.pdf

Discussion Leader: Rafael Fernandez


Friday, 03 August 2007 @ 10:00 AM

Title: In-Memory Grid Files on Graphics Processors

DaMon '07

Author(s): Ke Yang, Bingsheng He, Rui Fang, Mian Lu, Naga Govindaraju, Qiong Luo, Pedro Sander, and Jiaoying Shi

Available at: http://www.cs.cmu.edu/~damon2007/pdf/yang07inmemorygrid.pdf

Discussion Leader: Kristin Tufte


Friday, 27 July 2007 @ 9:00 AM

Title: Map-reduce-merge: simplified relational data processing on large clusters

SIGMOD '07

Author(s): Hung-chih Yang, Ali Dasdan, Ruey-Lung Hsiao, and D. Stott Parker

Available at: http://doi.acm.org/10.1145/1247480.1247602

Discussion Leader: Len Shapiro


Friday, 20 July 2007 @ 10:00 AM

Title: iTrails: Pay-as-you-go Information Integration in Dataspaces

VLDB '07

Author(s): Marcos Antonio Vaz Salles, Jens-Peter Dittrich, Shant Kirakos Karakashian, Olivier René Girard, and Lukas Blunschi

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Dave Maier


Friday, 13 July 2007 @ 10:00 AM

Title: The five-minute rule twenty years later, and how flash memory changes the rules

DaMon '07

Author(s): Goetz Graefe

Available at: http://www.cs.cmu.edu/~damon2007/pdf/graefe07fiveminrule.pdf

Discussion Leader: Vassilis Papadimos


Friday, 15 June 2007 @ 10:00 AM

Title: Sharing Aggregate Computation for Distributed Queries

SIGMOD '07

Author(s): Ryan Huebsch, Minos Garofalakis, Joseph M. Hellerstein, and Ion Stoica

Available at: http://db.cs.berkeley.edu/papers/sigmod07-aggshare.pdf

Discussion Leader: Vassilis Papadimos


Friday, 08 June 2007 @ 10:00 AM

Title: Personalized Queries under a Generalized Preference Model

ICDE '05

Author(s): Georgia Koutrika and Yannis Ioannidis

Available at: http://ieeexplore.ieee.org/iel5/9680/30564/01410197.pdf

Discussion Leader: Len Shapiro


Friday, 01 June 2007 @ 10:00 AM

Title: Toward Entity Retrieval over Structured and Text Data

WIRD '04

Author(s): Sayyadian, M., Shakery, A., Doan, A., Zhai, C.

Available at: http://www.cs.wisc.edu/~mayssam/files/wird04_integrating-db-ir.pdf

Discussion Leader: David Archer


Friday, 25 May 2007 @ 10:00 AM

Title: MoBIoS: a Metric-Space DBMS to Support Biological Discovery

SSDBM '03

Author(s): Daniel Miranker, Weijia Xu, and Rui Mao

Available at: http://www.cs.utexas.edu/users/mobios/mobios_papers/2003-MoBIoS-Final.pdf

Discussion Leader: David Maier


Friday, 18 May 2007 @ 10:00 AM

Title: Efficient Reverse k-Nearest Neighbor Search in Arbitrary Metric Spaces

SIGMOD '06

Author(s): Elke Achtert, Christian Bohm, Peer Kroger, Peter Kunath, Alexey Pryakhin, and Matthias Renz

Available at: http://www.dbs.informatik.uni-muenchen.de/Publikationen/Papers/SIGMOD06_rknn.pdf

Discussion Leader: Rafael Fernandez


Friday, 11 May 2007 @ 10:00 AM

Title: Safety guarantee of continuous join queries over punctuated data streams

VLDB '06

Author(s): Hua-Gang Li, Songting Chen, Junichi Tatemura, Divyakant Agrawal, Selcuk Kandan, and Wang-Pin Hsiung

Available at: http://portal.acm.org/citation.cfm?id=1164131&dl=ACM&coll=&CFID=15151515&CFTOKEN=6184618

Discussion Leader: Jin Li


Friday, 04 May 2007 @ 10:00 AM

Title: Realizing Parallelism in Database Operations: Insights from a Massively Multithreaded Architecture

Author(s): John Cieslewicz, Jonathan Berry, Bruce Hendrickson, Kenneth A. Ross

Available at: http://www.cs.cmu.edu/~damon2006/pdf/cieslewicz06parallelism.pdf

Discussion Leader: Kristin Tufte


Friday, 27 April 2007 @ 10:00 AM

Title: Compiling Mappings to Bridge Applications and Databases

SIGMOD '07

Author(s): Sergey Melnik, Atul Adya, and Phil Bernstein

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: James Terwilliger


Friday, 20 April 2007 @ 10:00 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Susan Price


Friday, 13 April 2007 @ 10:00 AM

Title: A Relational View of the Semantic Web

Author(s): Andrew Newman

Available at: http://www.xml.com/pub/a/2007/03/14/a-relational-view-of-the-semantic-web.html

Discussion Leader: Lois Delcambre


Friday, 13 April 2007 @ 10:00 AM

Title: SPARQL Query Language for RDF

Author(s): W3C Candidate Recommendation

Available at: http://www.w3.org/TR/2006/CR-rdf-sparql-query-20060406/

Discussion Leader: Lois Delcambre


Friday, 23 March 2007 @ 10:00 AM

Title: SEMEX: Toward On-the-fly Personal Information Integration

VLDB II Web workshop 2004

Author(s): Dong, X., Halevy, A., Nemes, E., Sigurdsson, S., Domingos, P.

Available at: http://data.cs.washington.edu/semex/semex_iiweb.pdf

Discussion Leader: David Archer


Friday, 09 March 2007 @ 10:00 AM

Title: Declarative networking: language, execution and optimization

SIGMOD '06

Author(s): Boon Thau Loo, Tyson Condie, Minos Garofalakis, David E. Gay, Joseph M. Hellerstein, Petros Maniatis, Raghu Ramakrishnan, Timothy Roscoe, and Ion Stoica

Available at: http://doi.acm.org/10.1145/1142473.1142485

Discussion Leader: David Maier


Friday, 02 March 2007 @ 10:00 AM

Title: Consistent Streaming Through Time: A Vision for Event Stream Processing

CIDR '07

Author(s): Roger Barga, Jonathan Goldstein, Mohamed Ali, and Mingsheng Hong

Available at: http://complexevents.com/?p=173

Discussion Leader: Jin Li


Friday, 23 February 2007 @ 10:00 AM

Title: A Heterogeneous Field Matching Method for Record Linkage

ICDM '05

Author(s): S. Minton, C. Nanjo, C. A. Knoblock, M. Michalowski, and M. Michelson

Available at: http://ieeexplore.ieee.org/iel5/10470/33217/01565694.pdf?tp=&arnumber=1565694&isnumber=33217

Discussion Leader: Nick Rayner


Friday, 16 February 2007 @ 10:00 AM

Title: Life Beyond Distributed Transactions: An Apostate's Opinion

CIDR '07

Author(s): Pat Helland

Available at: http://www-db.cs.wisc.edu/cidr/cidr2007/papers/P15.pdf

Discussion Leader: Vassilis Papadimos


Friday, 09 February 2007 @ 10:00 AM

Title: A language modeling approach to information retrieval

SIGIR '98

Author(s): Jay M. Ponte, W. Bruce Croft

Available at: http://portal.acm.org/citation.cfm?id=291008

Discussion Leader: Susan Price


Friday, 02 February 2007 @ 10:00 AM

Title: Community-Driven Ontology Matching

ESWC 2006

Author(s): Zhdanova, A., Shvaiko, P.

Available at: http://www.ee.surrey.ac.uk/Personal/A.Zhdanova/papers/eswc06_ontology_matching.pdf

Discussion Leader: Lois Delcambre


Friday, 26 January 2007 @ 10:00 AM

Title: Moirae: History-Enhanced Monitoring

CIDR '07

Author(s): Magdalena Balazinska, YongChul Kwon, Nathan Kuchta, and Dennis Lee

Available at: http://www-db.cs.wisc.edu/cidr/cidr2007/papers/P43.pdf

Discussion Leader: Kristin Tufte


Friday, 19 January 2007 @ 10:00 AM

Title: Relational Lenses: A Language for Updatable Views

PODS '06

Author(s): Aaron Bohannon, Benjamin Pierce, and Jeffrey A. Vaughan

Available at: http://www.cis.upenn.edu/~bcpierce/papers/dblenses-pods.pdf

Discussion Leader: James Terwilliger


Friday, 12 January 2007 @ 10:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:


Friday, 08 December 2006 @ 10:00 AM

Title: Using ECA Rules to Implement Mobile Query Agents for Fast-Evolving Pure P2P Database Systems

MDM '06

Author(s): Verena Kantere, and Aris Tsois

Available at: http://delivery.acm.org/10.1145/1080000/1071271/p164-kantere.pdf?key1=1071271&key2=3344059511&coll=Portal&dl=ACM&CFID=1953853&CFTOKEN=85711509

Discussion Leader: Rafael J. Fernandez-Moctezuma


Friday, 01 December 2006 @ 10:00 AM

Title: Sudokus as Logical Puzzles

DISPROVING '06

Author(s): Thomas Hillenbrand, Dalibor Topic, and Christoph Weidenback

Available at: http://www.mpi-sb.mpg.de/~hillen/documents/HTW06.ps

Discussion Leader: Nick Rayner


Friday, 17 November 2006 @ 10:00 AM

Title: Textpresso: An Ontology-Based Information Retrieval and Extraction System for Biological Literature

PLoS Biology

Author(s): Hans-Michael Muller, Eimear E. Kenny, and Paul W. Sternberg

Available at: http://biology.plosjournals.org/perlserv?request=get-pdf&file=10.1371_journal.pbio.0020309-L.pdf

Discussion Leader: David Maier


Friday, 03 November 2006 @ 10:00 AM

Title: Magnet: Supporting Navigation in Semistructured Data Environments

Author(s): Sinha, V., Karger, D.

Available at: http://haystack.lcs.mit.edu/papers/magnet-sigmod2005.pdf

Discussion Leader: David Archer


Friday, 27 October 2006 @ 10:00 AM

Title: Modeling Skew in Data Streams

SIGMOD '06

Author(s): Flip Korn, S. Muthukrishnan, and Yihua Wu

Available at: http://portal.acm.org/citation.cfm?id=1142473.1142495

Discussion Leader: Jin Li


Friday, 20 October 2006 @ 10:00 AM

Title: User performance versus precision measures for simple search tasks

SIGIR '06

Author(s): Andrew Turpin and Falk Scholer

Available at: http://portal.acm.org/citation.cfm?id=1148176

Discussion Leader: Susan Price


Friday, 06 October 2006 @ 10:00 AM

Title: User-defined aggregate functions: bridging theory and practice

SIGMOD '06

Author(s): Sara Cohen

Available at: http://doi.acm.org/10.1145/1142473.1142480

Discussion Leader: James Terwilliger


Friday, 15 September 2006 @ 10:00 AM

Title: Privacy enhancing identity management: protection against re-identification and profiling

Author(s): Sebastian Clauss, Dogan Kesdogan, Tobias Kolsch

Available at: http://doi.acm.org/10.1145/1102486.1102501

Discussion Leader: Nick Rayner


Friday, 08 September 2006 @ 10:00 AM

Title: On redundancy vs dependency preservation in normalization: an information-theoretic study of 3NF

PODS '06

Author(s): Solmaz Kolahi, and Leonid Libkin

Available at: http://doi.acm.org/10.1145/1142351.1142369

Discussion Leader: David Maier


Friday, 01 September 2006 @ 10:00 AM

Title: Personalizing Electronic Books

Author(s): Ohene-Djan, James, and Fernandes, Alvaro A. A.

Available at: http://jodi.tamu.edu/Articles/v03/i04/Ohene-Djan/

Discussion Leader: David Archer


Friday, 25 August 2006 @ 10:00 AM

Title: To Search or to Crawl? Towards a Query Optimizer for Text-Centric Tasks

SIGMOD '06

Author(s): P. Ipeirotis, E. Agichtein, P. Jain, L. Gravano

Available at: http://www1.cs.columbia.edu/~gravano/Papers/2006/sigmod06.pdf

Discussion Leader: Laura Bright


Friday, 18 August 2006 @ 10:00 AM

Title: Buffer Pool Aware Query Optimization

CIDR '05

Author(s): Ravishankar Ramamurthy, and David J. DeWitt

Available at: http://www-db.cs.wisc.edu/cidr/cidr2005/papers/P21.pdf

Discussion Leader: Len Shapiro


Friday, 11 August 2006 @ 10:00 AM

Title: How to cite curated databases and how to make them citable

Author(s): Peter Buneman

Available at: http://homepages.inf.ed.ac.uk/opb/homepagefiles/harmarnew.pdf

Discussion Leader: Bill Howe


Friday, 04 August 2006 @ 10:00 AM

Title: Window-aware Load Shedding for Aggregation Queries over Data Streams

VLDB '06

Author(s): Nesime Tatbul, and Stan Zdonik

Available at: http://www.cs.brown.edu/~tatbul/publications/vldb06.pdf

Discussion Leader: Jin Li


Friday, 28 July 2006 @ 10:00 AM

Title: Relaxed Currency Serializability for Middle-Tier Caching and Replication

SIGMOD '06

Author(s): P. A. Bernstein, A. Fekete, H. Guo, R. Ramakrishnan, P. Tamma

Available at: http://doi.acm.org/10.1145/1142473.1142540

Discussion Leader: Vassilis Papadimos


Friday, 21 July 2006 @ 10:00 AM

Title: L-Diversity: Privacy Beyond k-Anonymity

ICDE '06

Author(s): Machanavajjhala, Gehrke, Kifer, Venkitasubramaniam

Available at: http://www.cs.cornell.edu/johannes/papers/2006/2006-icde-publishing.pdf

Discussion Leader: James Terwilliger


Friday, 14 July 2006 @ 10:00 AM

Title: Incremental Test Collections

CIKM '05

Author(s): Ben Carterette and James Allan

Available at: http://doi.acm.org/10.1145/1099554.1099723

Discussion Leader: Susan Price


Friday, 07 July 2006 @ 10:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone


Friday, 16 June 2006 @ 10:00 AM

Title: RAM: Array Processing over a Relational DBMS

CWI Tech. Report

Author(s): A. R. van Ballegooij, A. P. de Vries, M. L. Kersten

Available at: http://www.cwi.nl/ftp/CWIreports/INS/INS-R0301.pdf

Discussion Leader: Bill Howe


Friday, 09 June 2006 @ 10:00 AM

Title: Cracking the Database Store

CIDR '05

Author(s): Martin Kersten and Stefan Manegold

Available at: http://www-db.cs.wisc.edu/cidr/cidr2005/papers/P18.pdf

Discussion Leader: David Maier


Friday, 02 June 2006 @ 10:00 AM

Title: Supporting Exploratory Search (Communications of the ACM Special Issue)

Author(s): Ryen W. White, Bill Kules, Steven M. Drucker, and M. C. Schraefel (eds.)

Available at: http://portal.acm.org/toc.cfm?id=1121949&type=issue&coll=ACM&dl=ACM&CFID=73163798&CFTOKEN=29872594#1121977

Discussion Leader: Susan Price


Friday, 26 May 2006 @ 10:00 AM

Title: Clean Answers Over Dirty Databases

ICDE '06

Author(s): Periklis Andritsos, Ariel Fuxman, and Rene Miller

Available at: http://www.cs.toronto.edu/~afuxman/publications/icde06.pdf

Discussion Leader: James Terwilliger


Friday, 19 May 2006 @ 11:00 AM

Title: Declarative Querying for Biological Sequences

ICDE 2006

Author(s): S. Tata, J. Patel, J. Friedman, A. Swaroop

Available at: http://www.eecs.umich.edu/~jignesh/publ/SEQ.pdf

Discussion Leader: Laura Bright


Friday, 12 May 2006 @ 10:00 AM

Title: Tackling inconsistencies in data integration through source preferences

Author(s): G. De Giacomo, D. Lembo, M. Lenzerini, R. Rosati

Available at: http://doi.acm.org/10.1145/1012453.1012459

Discussion Leader: Nick Rayner


Friday, 05 May 2006 @ 10:00 AM

Title: Transaction Time Support Inside a Database Engine

ICDE 2006

Author(s): David Lomet, Roger Barga, Mohamed F. Mokbel, German Shegalov, Rui Wang, Yunye Zhu

Available at: ftp://ftp.research.microsoft.com/users/lomet/pub/ImmortalDB-ICDE6.pdf

Discussion Leader: Vassilis Papadimos


Friday, 28 April 2006 @ 10:00 AM

Title: Updates Through Views: A New Hope

Author(s): Yannis Kotidis, Divesh Srivastava, and Yannis Velegrakis

Available at: http://public.research.att.com/~velgias/papers/KSV06.pdf

Discussion Leader: David Archer


Friday, 21 April 2006 @ 10:00 AM

Title: Declarative Network Monitoring with an Underprovisioned Query Processor

ICDE '06

Author(s): Frederick Reiss and Joseph Hellerstein

Available at: http://www.cs.berkeley.edu/~phred/pubs/data_triage_icde_2006.pdf

Discussion Leader: Jin Li


Friday, 14 April 2006 @ 10:00 AM

Title: Interconnections in multi-core architectures: Understanding Mechanisms, Overheads and Scaling

Author(s): Rakesh Kumar, Victor Zyuban, Dean Tullsen

Available at: http://www.cse.ucsd.edu/~rakumar/isca05.pdf

Discussion Leader: Kristin Tufte


Friday, 07 April 2006 @ 10:00 AM

Title: Why Do Computers Stop and What Can Be Done About It

Author(s): J. Gray

Available at: http://research.microsoft.com/~gray/papers/TandemTR85.7_WhyDoComputersStop.pdf

Discussion Leader: David Maier


Friday, 24 March 2006 @ 10:00 AM

Title: A Report on the NSF-Sponsored Workshop on Personal Information Management, Seattle, WA, 2005

Author(s): William Jones, Harry Bruce

Available at: http://www.ischool.washington.edu/pim/report%20NSF%20PIM%20workshop%20Seattle%202005%20draft.pdf

Discussion Leader: Len Shapiro


Friday, 17 March 2006 @ 10:00 AM

Title: Unpublished draft

Author(s): U. C. Berkeley

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Kristin Tufte


Friday, 10 March 2006 @ 10:00 AM

Title: C-Store: A Column-oriented DBMS

VLDB '05

Author(s): Mike Stonebraker, Daniel Abadi, Adam Batkin, Xuedong Chen, Mitch Cherniack, Miguel Ferreira, Edmond Lau, Amerson Lin, Sam Madden, Elizabeth O'Neil, Pat O'Neil, Alex Rasin, Nga Tran, Stan Zdonik

Available at: http://www.vldb2005.org/program/paper/thu/p553-stonebraker.pdf

Discussion Leader: Laura Bright


Friday, 03 March 2006 @ 10:00 AM

Title: Enhancing P2P File-Sharing with an Internet-Scale Query Processor

VLDB '04

Author(s): Boon Thau Loo, Joseph M. Hellerstein, Ryan Huebsch, Scott Shenker, Ion Stoica

Available at: http://www.vldb.org/conf/2004/RS11P2.PDF

Discussion Leader: Vassilis Papadimos


Thursday, 23 February 2006 @ 5:00 PM

Title: An Information Network Overlay Architecture for the NSDL

Author(s): Carl Lagoze, Dean B. Krafft, Susan Jesuroga, Tim Cornwell, Ellen J. Cramer, and Eddie Shin

Available at: http://arxiv.org/abs/cs.DL/0501080

Discussion Leader: Lois Delcambre


Friday, 17 February 2006 @ 10:00 AM

Title: Information-Theoretic Tools for Mining Database Structure from Large Data Sets

SIGMOD '04

Author(s): Periklis Andritsos, Renee J. Miller, and Panayiotis Tsaparas

Available at: http://doi.acm.org/10.1145/1007568.1007650

Discussion Leader: Nick Rayner


Friday, 10 February 2006 @ 10:00 AM

Title: Compiled Query Execution Engine using JVM

To appear in ICDE 2006

Author(s): Jun Rao, Hamid Pirahesh, C. Mohan, Guy Lohman

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Sudarshan Murthy


Friday, 03 February 2006 @ 10:00 AM

Title: Blueprints for ETL workflows

This is a longer version of a paper presented in ER 2005

Author(s): P. Vassiliadis, A. Simitsis, M. Terrovitis, S. Skiadopoulos

Available at: http://www.cs.uoi.gr/~pvassil/publications/2005_ER_AG/ETL_blueprints_long.pdf

Discussion Leader: James Terwilliger


Friday, 27 January 2006 @ 10:00 AM

Title: A Distributed Event Delivery Method with Load Balancing for MMORPG

Author(s): Shinya Yamamoto, Yoshihiro Murata, Keiichi Yasumoto, Minoru Ito

Available at: http://www.research.ibm.com/netgames2005/papers/yamamoto.pdf

Discussion Leader: Bill Howe


Friday, 27 January 2006 @ 10:00 AM

Title: Dynamic Microcell Assignment for Massively Multiplayer Online Gaming

Author(s): Bart De Vleeschauwer, Bruno Van Den Bossche, Tom Verdickt, Filip De Turck, Bart Dhoedt, Piet Demeester

Available at: http://www.research.ibm.com/netgames2005/papers/devleesch.pdf

Discussion Leader: Bill Howe


Friday, 20 January 2006 @ 10:00 AM

Title: Reference reconciliation in complex information spaces

Author(s): Xin Dong, Alon Halevy, and Jayant Madhavan

Available at: http://delivery.acm.org/10.1145/1070000/1066168/p85-dong.pdf?key1=1066168&key2=5059817311&coll=ACM&dl=ACM&CFID=65499903&CFTOKEN=59582496

Discussion Leader: Susan Price


Friday, 13 January 2006 @ 10:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:


Friday, 09 December 2005 @ 10:00 AM

Title: The Google File System

SOSP 2003

Author(s): Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung

Available at: http://www.cs.rochester.edu/sosp2003/papers/p125-ghemawat.pdf

Discussion Leader: David Maier


Friday, 02 December 2005 @ 10:00 AM

Title: Analyzing Plan Diagrams of Database Query Optimizers

VLDB '05

Author(s): Naveen Reddy and Jayant Haritsa

Available at: http://www.vldb2005.org/program/paper/fri/p1228-reddy.pdf

Discussion Leader: Len Shapiro


Friday, 18 November 2005 @ 10:00 AM

Title: A Framework for Reliable and Efficient Data Placement in Distributed Computing Systems

To appear in Journal of Parallel and Distributed Computing, (2005)

Author(s): Tevfik Kosar and Miron Livny

Available at: http://www.cs.wisc.edu/condor/stork/papers/data_subsystem-jpdc05.pdf

Discussion Leader: Laura Bright


Friday, 04 November 2005 @ 10:00 AM

Title: Semantic File Systems

Author(s):

Available at: http://www.objs.com/survey/OFSExt.htm

Discussion Leader: Eric Hanson


Friday, 28 October 2005 @ 10:00 AM

Title: MDL Summarization with Holes

VLDB '05

Author(s): Shaofeng Bu, Laks V.S. Lakshmanan, and Raymond T. Ng

Available at: http://www.vldb2005.org/program/paper/wed/p433-bu.pdf

Discussion Leader: Vassilis Papadimos


Friday, 21 October 2005 @ 10:00 AM

Title: Scientific Data Management in the Coming Decade

Microsoft Tech. Report

Author(s): Jim Gray, David T. Liu, Maria A. Nieto-Santisteban, Alexander S. Szalay, Gerd Heber, and David DeWitt

Available at: http://research.microsoft.com/research/pubs/view.aspx?tr_id=860

Discussion Leader: Bill Howe


Friday, 21 October 2005 @ 10:00 AM

Title: Where the Rubber Meets the Sky: Bridging the Gap between Databases and Science

IEEE Data Engineering Bulletin, December 2004

Author(s): Jim Gray and Alex Szalay

Available at: http://research.microsoft.com/research/pubs/view.aspx?tr_id=815

Discussion Leader: Bill Howe


Friday, 14 October 2005 @ 10:00 AM

Title: A Three-Layered XML View Model: A Practical Approach

ER '05

Author(s): Rajugan R., Elizabeth Chang, Tharam S. Dillon, and Ling Feng

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Sun Murthy


Friday, 07 October 2005 @ 10:00 AM

Title: Declarative Data Cleaning: Language, Model, and Algorithms

VLDB '01

Author(s): Helena Galhardas, Daniela Florescu, Dennis Shasha, Eric Simon, and Cristian-Augustin Saita

Available at: http://www.vldb.org/conf/2001/P371.pdf

Discussion Leader: James Terwilliger


Friday, 02 September 2005 @ 10:00 AM

Title: Towards a semantics for XML markup

Author(s): Allen Renear, David Dubin, and C. M. Sperberg-McQueen

Available at: http://doi.acm.org/10.1145/585058.585081

Discussion Leader: Susan Price


Friday, 02 September 2005 @ 10:00 AM

Title: Methods for the semantic analysis of document markup

Author(s): P. S. Bayerl et al.

Available at: http://doi.acm.org/10.1145/958220.958250

Discussion Leader: Susan Price


Wednesday, 31 August 2005 @ 4:15 PM

Title: DBRG hip-hop session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone


Friday, 26 August 2005 @ 10:00 AM

Title: Principled Design of the Modern Web Architecture

ICSE 2000

Author(s): Roy T. Fielding, and Richard N. Taylor

Available at: http://doi.acm.org/10.1145/337180.337228

Discussion Leader: Sudarshan Murthy/Eric Hanson


Wednesday, 24 August 2005 @ 4:15 PM

Title:

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Dave Maier


Friday, 19 August 2005 @ 10:00 AM

Title: Sampling Algorithms in a Stream Operator

SIGMOD '05

Author(s): Theodore Johnson, S. Muthukrishnan, and Irina Rozenbaum

Available at: http://doi.acm.org/10.1145/1066157.1066159

Discussion Leader: Kristin Tufte


Wednesday, 17 August 2005 @ 4:15 PM

Title: The Object-Oriented Database System Manifesto

Author(s): M. Atkinson, F. Bancilhon, D. DeWitt, K. Dittrich, D. Maier, and S. Zdonik

Available at: http://www-2.cs.cmu.edu/afs/cs.cmu.edu/user/clamen/OODBMS/Manifesto/

Discussion Leader: Lois Delcambre


Wednesday, 17 August 2005 @ 4:15 PM

Title: Third-generation Database System Manifesto

Author(s): The Committee for Advanced DBMS Function

Available at: http://doi.acm.org/10.1145/101077.390001

Discussion Leader: Lois Delcambre


Friday, 12 August 2005 @ 10:00 AM

Title: Stacked indexed views in microsoft SQL server

SIGMOD '05

Author(s): David DeHaan, Per-Ake Larson, and Jingren Zhou

Available at: http://doi.acm.org/10.1145/1066157.1066179

Discussion Leader: David Maier


Wednesday, 10 August 2005 @ 4:15 PM

Title: A database perspective on the semantic web: a brief commentary

Author(s): Frank Manola

Available at: http://sites.computer.org/debull/A03dec/manola.ps

Discussion Leader: Nick Rayner


Friday, 05 August 2005 @ 10:00 AM

Title: Automatic Performance Diagnosis and Tuning in Oracle

Author(s): Karl Dias, Mark Ramacher, Uri Shaft, Venkateshwaran Venkataramani, and Graham Wood

Available at: http://www.cs.pdx.edu/~len/AutomaticOracleTuning.pdf

Discussion Leader: Len Shapiro


Wednesday, 03 August 2005 @ 4:15 PM

Title:

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Chris Dubay


Friday, 29 July 2005 @ 10:00 AM

Title: Update-Pattern-Aware Modeling and Processing of Continuous Queries

SIGMOD '05

Author(s): Lukasz Golab, and M. Tamer Ozsu

Available at: http://doi.acm.org/10.1145/1066157.1066232

Discussion Leader: Jin Li


Wednesday, 20 July 2005 @ 4:15 PM

Title: Database systems: achievements and opportunities

Author(s): Avi Silberschatz, Michael Stonebraker, Jeff Ullman

Available at: http://doi.acm.org/10.1145/125223.125272

Discussion Leader: Susan Price


Wednesday, 20 July 2005 @ 4:15 PM

Title: The Lowell Database Research Self-Assessment Meeting

Author(s):

Available at: http://research.microsoft.com/~gray/lowell/

Discussion Leader: Susan Price


Friday, 15 July 2005 @ 10:00 AM

Title: System RX: One Part Relational, One Part XML

SIGMOD '05

Author(s): K. Beyer et al.

Available at: http://doi.acm.org/10.1145/1066157.1066197

Discussion Leader: Laura Bright


Wednesday, 13 July 2005 @ 4:15 PM

Title: On Six Degrees of Separation in DBLP-DB and More

Author(s): E. Elmacioglu and D. Lee

Available at: http://www.sigmod.org/sigmod/record/issues/0506/p33-article-lee.pdf

Discussion Leader: Laura Bright


Friday, 08 July 2005 @ 10:00 AM

Title: A Graphical Language for Relational Multi-Database Querying and Restructuring

ICCI '98

Author(s): Fereidoon Sadri and Patrick L. Shouse

Available at: http://citeseer.ist.psu.edu/263426.html

Discussion Leader: James Terwilliger


Wednesday, 06 July 2005 @ 4:15 PM

Title: You and your Research

Author(s): Richard Hamming (transcribed by J. F. Kaiser)

Available at: http://www.cs.virginia.edu/~robins/YouAndYourResearch.html

Discussion Leader: Vassilis Papadimos


Friday, 01 July 2005 @ 10:00 AM

Title: Relational confidence bounds are easy with the bootstrap

SIGMOD '05

Author(s): Abhijit Pol and Christopher Jermaine

Available at: http://doi.acm.org/10.1145/1066157.1066224

Discussion Leader: Vassilis Papadimos


Wednesday, 29 June 2005 @ 4:15 PM

Title: Rooter: A Methodology for the Typical Unification of Access Points and Redundancy

Author(s): Jeremy Stribling, Daniel Aguayo and Maxwell Krohn

Available at: http://pdos.csail.mit.edu/scigen/rooter.pdf

Discussion Leader: James Terwilliger


Friday, 24 June 2005 @ 10:00 AM

Title: Paper-choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:


Friday, 10 June 2005 @ 10:00 AM

Title: A Heartbeat Mechanism and its Application in Gigascope

To appear in VLDB 2005

Author(s): Theodore Johnson, S. Muthukrishnan, Vladislav Shkapenyuk, and Oliver Spatscheck

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Dave Maier


Friday, 03 June 2005 @ 10:00 AM

Title: Applying Model Management to Classical Meta Data Problems

CIDR '03

Author(s): Philip Bernstein

Available at: http://www.research.microsoft.com/~philbe/PBernsteinCIDR12ext.pdf

Discussion Leader: James Terwilliger


Friday, 27 May 2005 @ 10:00 AM

Title: A Game Theoretic Framework for Incentives in P2P Systems

P2P '03

Author(s): Chiranjeeb Buragohain, Divyakant Agrawal

Available at: http://www.cs.ucsb.edu/~suri/psdir/incentives.pdf

Discussion Leader: Vassilis Papadimos


Friday, 20 May 2005 @ 10:00 AM

Title: Robust and Fast Similarity Search for Moving Object Trajectories

Author(s): Lei Chen, Tamer Ozsu, Vincent Oria

Available at: http://db.uwaterloo.ca/~ddbms/publications/multimedia/sigmod05-leichen.pdf

Discussion Leader: Bill Howe


Friday, 13 May 2005 @ 10:00 AM

Title: Vision Paper: Enabling Privacy for the Paranoids

Author(s): G. Aggarwal et al.

Available at: http://dbpubs.stanford.edu:8090/pub/2004-41

Discussion Leader: Laura Bright


Friday, 13 May 2005 @ 10:00 AM

Title: Privacy-preserving data integration and sharing

DMKD'04

Author(s): Chris Clifton, Murat Kantarcioglu, AnHai Doan, Gunther Schadow, Jaideep Vaidya, Ahmed Elmagarmind, Dan Suciu

Available at: http://doi.acm.org/10.1145/1008694.1008698

Discussion Leader: Nick Rayner


Friday, 06 May 2005 @ 10:00 AM

Title: QPipe: A Simultaneously Pipelined Relational Query Engine.

SIGMOD '05

Author(s): S. Harizopoulos, V. Shkapenyuk, A. Ailamaki.

Available at: http://www-2.cs.cmu.edu/~stavros/publications/qpipe.pdf

Discussion Leader: Kristin Tufte


Friday, 29 April 2005 @ 10:00 AM

Title: An Evaluation of Non-Equijoin Algorithms

VLDB 1991

Author(s): David J. Dewitt, Jeffrey F. Naughton, Donovan A. Schneider

Available at: http://www.vldb.org/conf/1991/P443.PDF

Discussion Leader: Jin Li


Friday, 22 April 2005 @ 10:00 AM

Title: Search Middleware and the Simple Digital Library Interoperability Protocol

Author(s): Paepcke, A., Brandriff, R., Janee, G., Larson, R., Ludaescher, B., Melnik, S., Raghavan, S.

Available at: http://www.dlib.org/dlib/march00/paepcke/03paepcke.html

Discussion Leader: Sun Murthy


Friday, 22 April 2005 @ 10:00 AM

Title: Core Services in the Architecture of the National Science Digital Library

JCDL '02

Author(s): Carl Lagoze, Walter Hoehn, David Millman, William Arms, Stoney Gan, Diane Hillmann, Christopher Ingram, Dean Krafft, Richard Marisa, Jon Phipps, John Saylor, Carol Terrizzi, Allan, Sergio Guzman-Lara, Tom Kalt

Available at: http://portal.acm.org/citation.cfm?id=544220.544264

Discussion Leader: Sun Murthy


Friday, 15 April 2005 @ 10:00 AM

Title: Implementing A Scalable XML Publish/Subscribe System Using Relational Database Systems

SIGMOD 2004

Author(s): Tian, Reinwald, Pirahesh, Mayr, and Myllymaki

Available at: http://portal.acm.org/citation.cfm?id=1007623

Discussion Leader: Susan Price


Friday, 08 April 2005 @ 10:00 AM

Title: RQL: A Declarative Query Language for RDF

Author(s): Gregory Karvounarakis, Vassilis Christophides, Sofia Alexaki, Dimitris Plexousakis, Michel Scholl

Available at: http://139.91.183.30:9090/RDF/publications/www2002/www2002.pdf

Discussion Leader: Lois Delcambre


Friday, 01 April 2005 @ 10:00 AM

Title: Paper choosing session

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:


Friday, 18 March 2005 @ 10:00 AM

Title: (Almost) Hands-off Information Integration for the Life Sciences

CIDR '05

Author(s): U. Leser, and F. Naumann

Available at: http://www-db.cs.wisc.edu/cidr/papers/P11.pdf

Discussion Leader: Laura Bright


Friday, 11 March 2005 @ 10:00 AM

Title: Using Probabilistic Models for Data Management in Acquisitional Environments

CIDR '05

Author(s): A. Desphande, C. Guestrin, and S. Madden

Available at: http://www-db.cs.wisc.edu/cidr/papers/P26.pdf

Discussion Leader: Kristin Tufte


Friday, 04 March 2005 @ 10:00 AM

Title: Towards a theory of natural language interfaces to databases

Author(s): Ana-Maria Popescu, Oren Etzioni, and Henry Kautz

Available at: http://doi.acm.org/10.1145/604045.604070

Discussion Leader: Nick Rayner


Friday, 25 February 2005 @ 10:00 AM

Title: The WebDAV Property Design

Author(s): E. James Whitehead, Jr., and Yaron Y. Goland

Available at: http://www.cs.ucsc.edu/~ejw/papers/spe-whitehead.pdf

Discussion Leader: Eric Hanson


Friday, 18 February 2005 @ 10:00 AM

Title: Supporting personal collections across digital libraries in spatial hypertext

JCDL '04

Author(s): Frank M. Shipman, Haowei Hsieh, J. Michael Moore, and Anna Zacchi

Available at: http://doi.acm.org/10.1145/996350.996433

Discussion Leader: David Maier


Friday, 11 February 2005 @ 10:00 AM

Title: Lessons Learned Managing a Petabyte

CIDR '05

Author(s): Jacek Becla and Daniel L. Wang

Available at: http://www-db.cs.wisc.edu/cidr/papers/P06.pdf

Discussion Leader: Bill Howe


Friday, 04 February 2005 @ 10:00 AM

Title: Web-Scale Information Extraction in KnowItAll (Preliminary Results)

Author(s): Etzioni et al.

Available at: http://www.cs.washington.edu/homes/weld/papers/www-paper.pdf

Discussion Leader: Susan Price


Friday, 28 January 2005 @ 10:00 AM

Title: The Design of the Borealis Stream Processing Engine

CIDR '05

Author(s): Daniel J. Abadi, Yanif Ahmad, Magdalena Balazinska, Ugur Centintemel, Mitch Cherniack, Jeong-Hyon Hwang, Wolfgang Lindner, Anurag S. Maskey, Alexander Rasin, Esther Ryvkina, Nesime Tatbul, Ying Xing, and Stan Zdonik

Available at: http://nms.lcs.mit.edu/papers/borealis-cidr05.pdf

Discussion Leader: Jin Li


Friday, 21 January 2005 @ 10:00 AM

Title: Lazy Query Evaluation for Active XML

SIGMOD '04

Author(s): Serge Abiteboul, Omar Benjelloun, Bogdan Cautis, Ioana Manolescu, Tova Milo, and Nicoleta Preda

Available at: ftp://ftp.inria.fr/INRIA/Projects/gemo/gemo/GemoReport-315.pdf

Discussion Leader: Vassilis Papadimos


Friday, 10 December 2004 @ 10:00 AM

Title: Remembrance of Streams Past: Overload-Sensitive Management of Archived Streams

VLDB 2004

Author(s): Sirish Chandrasekaran and Michael Franklin

Available at: http://www.vldb.org/conf/2004/RS9P3.PDF

Discussion Leader: Kristin Tufte


Friday, 03 December 2004 @ 10:00 AM

Title: SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases

SIGMOD 2004

Author(s): Mohamed F. Mokbel, Xiaopeng Xiong, and Walid G. Aref

Available at: http://www.cs.purdue.edu/homes/mokbel/SINA-SIGMOD04.pdf

Discussion Leader: Jin Li


Friday, 19 November 2004 @ 10:00 AM

Title: DataMover: Robust Terabyte-Scale Multi-file Replication over Wide-Area Networks

SSDBM 2004

Author(s): A. Sim, J. Gu, A. Shoshani, V. Natarajan

Available at: http://sdm.lbl.gov/~arie/papers/DataMover.SSDBM04.pdf

Discussion Leader: Laura Bright


Friday, 12 November 2004 @ 10:00 AM

Title: Estimating Progress of Execution for SQL Queries

SIGMOD 2004

Author(s): Chaudhuri S., Narasayya V., and Ramamurthy, R.,

Available at: ftp://ftp.research.microsoft.com/users/AutoAdmin/progress.pdf

Discussion Leader: Len Shapiro


Friday, 05 November 2004 @ 10:00 AM

Title: Query Languages and Data Models for Database Sequences and Data Streams

VLDB 2004

Author(s): Yan-Nei Law, Haixun Wang, and Carlo Zaniolo

Available at: http://www.cs.ucla.edu/~zaniolo/papers/vldb04cr.pdf

Discussion Leader: Dave Maier


Friday, 29 October 2004 @ 10:00 AM

Title: XML Packaging

Author(s): W3C XML Packaging Working Group

Available at: http://www.w3.org/XML/2000/07/xml-packaging-charter

Discussion Leader: Eric Hanson


Friday, 29 October 2004 @ 10:00 AM

Title: Related-Resource Discovery for XML

Author(s): Tim Bray

Available at: http://www.textuality.com/xml/why-pkg.html

Discussion Leader: Eric Hanson


Friday, 29 October 2004 @ 10:00 AM

Title: Typekit

Author(s): Eric Hanson

Available at: http://typekit.org/

Discussion Leader: Eric Hanson


Friday, 22 October 2004 @ 10:00 AM

Title: Colorful XML: One Hierarchy Isn't Enough

SIGMOD 2004

Author(s): H. V. Jagadish, Laks V.S. Lakshmanan, Monica Scannapieco, Divesh Srivastava, and Nuwee Wiwatwattana

Available at: http://www.eecs.umich.edu/db/timber/files/mct.pdf

Discussion Leader: Vassilis Papadimos


Friday, 15 October 2004 @ 10:00 AM

Title: Unifying Tables, Objects and Documents

DPCOOL 2003

Author(s): Erik Meijer, Wolfram Schulte, and Gavin Bierman

Available at: http://research.microsoft.com/users/schulte/Papers/UnifyingTablesObjectsAndDocuments(DPCOOL2003).pdf

Discussion Leader: James Terwilliger


Friday, 08 October 2004 @ 10:00 AM

Title: Trio: A System for Integrated Management of Data, Accuracy, and Lineage

Author(s): Jennifer Widom

Available at: http://dbpubs.stanford.edu:8090/pub/2004-40

Discussion Leader: Susan Price


Friday, 20 August 2004 @ 9:30 AM

Title: Efficient dynamic mining of constrained frequent sets

TODS 28(4), 2003

Author(s): Laks V. S. Lakshmanan, Carson Kai-Sang Leung, Raymond T. Ng

Available at: http://doi.acm.org/10.1145/958942.958944

Discussion Leader: Rafael Fernandez


Friday, 13 August 2004 @ 9:30 AM

Title: The Entity-Relationship model -- Toward a unified view of data

TODS 1(1), 1976

Author(s): Peter Pin-Shan Chen

Available at: http://doi.acm.org/10.1145/320434.320440

Discussion Leader: Susan Price


Friday, 06 August 2004 @ 9:30 AM

Title: Efficient Query Reformulation in Peer Data Management Systems

SIGMOD 2004

Author(s): Igor Tatarinov, and Alon Halevy

Available at: http://data.cs.washington.edu/papers/piazza-sigmod2004.pdf

Discussion Leader: Vassilis Papadimos


Friday, 30 July 2004 @ 9:30 AM

Title: GridDB: A Data-Centric Overlay for Scientific Grids

VLDB 2004

Author(s): David Lu, and Michael Franklin

Available at: http://www.cs.berkeley.edu/~dtliu/pubs/griddb_tr.pdf

Discussion Leader: Bill Howe


Friday, 23 July 2004 @ 9:30 AM

Title: Adapting to Source Properties in Processing Data Integration Queries

Author(s): Zachary G. Ives, Alon Halevy, and Daniel S. Weld

Available at: http://www.cis.upenn.edu/~zives/research/adp.pdf

Discussion Leader: Sun Murthy


Friday, 16 July 2004 @ 9:30 AM

Title: Holistic UDAFs at Streaming Speeds

SIGMOD 2004

Author(s): Graham Cormode, Theodore Johnson, Flip Korn, S. Muthukrishnan, Oliver Spatscheck, and Divesh Srivastava

Available at: http://www.research.att.com/~divesh/papers/cjkmss2004-udaf.pdf

Discussion Leader: Jin Li


Friday, 09 July 2004 @ 9:30 AM

Title: A comprehensive XQuery to SQL translation using dynamic interval encoding

SIGMOD 2003

Author(s): David DeHaan, David Toman, Mariano P. Consens, and M. Tamer Oszu

Available at: http://db.uwaterloo.ca/~david/papers-sigmod03.pdf

Discussion Leader: James Terwilliger


Friday, 02 July 2004 @ 9:30 AM

Title: Limiting Disclosure in Hippocratic Databases

VLDB 2004

Author(s): Kristen LeFevre, Rakesh Agrawal, Vuk Ercegovac, Raghu Ramakrishnan, Yirong Xu, and David DeWitt

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Kristin Tufte


Friday, 11 June 2004 @ 9:30 AM

Title: A Denotational Semantics for Continuous Queries over Streams and Relations

Author(s): Arvind Arasu and Jennifer Widom

Available at: http://dbpubs.stanford.edu:8090/pub/2004-19

Discussion Leader: David Maier


Friday, 11 June 2004 @ 9:30 AM

Title: The CQL Continuous Query Language: Semantic Foundations and Query Execution

Author(s): Arvind Arasu, Shivnath Babu, and Jennifer Widom

Available at: http://dbpubs.stanford.edu/pub/2003-67

Discussion Leader: David Maier


Friday, 04 June 2004 @ 9:30 AM

Title: Measurement, Modeling, and Analysis of a Peer-to-Peer File-Sharing Workload

SOSP '03

Author(s): Krishna P. Gummadi, Richard J. Dunn, Stefan Saroiu, Steven D. Gribble, Henry M. Levy, and John Zahorjan

Available at: http://doi.acm.org/10.1145/945445.945475

Discussion Leader: Vassilis Papadimos


Friday, 28 May 2004 @ 9:30 AM

Title: Processing Set Expressions over Continuous Update Streams

SIGMOD '03

Author(s): Sumit Ganguly, Minos Garofalakis, and Rajeev Rastogi

Available at: http://www.bell-labs.com/user/minos/Papers/sigmod03-cam.pdf

Discussion Leader: Kristin Tufte


Friday, 21 May 2004 @ 9:30 AM

Title: Passage Retrieval Based On Language Models

CIKM 02

Author(s): Xiaoyong Liu and W. Bruce Croft

Available at: http://ciir.cs.umass.edu/pubfiles/ir-268.pdf

Discussion Leader: Susan Price


Friday, 14 May 2004 @ 12:15 PM

Title: Optimizing Queries across Diverse Data Sources

VLDB 97

Author(s): Laura M. Haas, Donald Kossmann, Edward L. Wimmers, and Jun Yang

Available at: http://www.ai.mit.edu/people/jimmylin/papers/Haas97.pdf

Discussion Leader: Sun Murthy


Friday, 30 April 2004 @ 9:30 AM

Title: A survey of data mining and knowledge discovery software tools

ACM SIGKDD Explorations, Vol. 1, Issue 1

Author(s): Michael Goebel and Le Gruenwald

Available at: http://doi.acm.org/10.1145/846170.846172

Discussion Leader: Rafael Fernandez


Friday, 23 April 2004 @ 9:30 AM

Title: Ad hoc Query Support for Very Large Simulation Mesh Data: the Metadata Approach.

Author(s): B.S. Lee, R.R. Snapp, R. Musick, and T. Critchlow

Available at: http://www.llnl.gov/casc/people/critchlow/pubs/SBBD-01.pdf

Discussion Leader: Bill Howe


Friday, 16 April 2004 @ 9:30 AM

Title: Efficient Execution of Sliding-Window Queries over Data Streams

Purdue Technical Report

Author(s): Moustafa A. Hammad, Walid G. Aref, Michael J. Franklin, Mohamed F. Mokbel and Ahmed K. Elmagarmid

Available at: http://www.cs.purdue.edu/homes/aref/papers/StreamQueryProcessing-TechReport2003.pdf

Discussion Leader: Jin Li


Friday, 09 April 2004 @ 9:30 AM

Title: Tables As a Paradigm for Querying and Restructuring

PODS 96

Author(s): Marc Gyssens, Laks V. S. Lakshmanan , and Iyer N. Subramanian

Available at: http://portal.acm.org/citation.cfm?id=237688&dl=ACM&coll=portal

Discussion Leader: James Terwilliger


Friday, 19 March 2004 @ 9:30 AM

Title: Operator Scheduling in a Data Stream Manager

Author(s): Don Cranery, Ugur Cetintemel, Alex Rasin, Stan Zdonik, Mitch Cherniack, Mike Stonebraker

Available at: http://www.cs.brown.edu/~dpc/publications/vldb2003.pdf

Discussion Leader: Kristin Tufte


Friday, 12 March 2004 @ 9:30 AM

Title: Optimizing Fixed-Schema XML to SQL Query Translation

VLDB 2002

Author(s): Rajasekar Krishnamurthy, Raghav Kaushik and Jeffrey F. Naughton

Available at: http://www.cs.wisc.edu/niagara/papers/XMLtoSQL-VLDB02.pdf

Discussion Leader: Fang Du


Friday, 27 February 2004 @ 9:30 AM

Title: Information Integration in Schema-Based Peer-to-Peer Networks

CAISE 2003

Author(s): Alexander Loeser, Wolf Siberski, Martin Wolpers and Wolfgang Nejdl

Available at: http://cis.cs.tu-berlin.de/~aloeser/publications/Caise03_submission_Loeser_Nejdl_Wolpers_Siberski.PDF

Discussion Leader: Vassilis Papadimos


Friday, 20 February 2004 @ 9:30 AM

Title: nD-SQL: A multi-dimensional language for interoperability and OLAP

VLDB 1998

Author(s): F. Gingras and L. V. S. Lakshmanan

Available at: http://ftp.cs.concordia.ca/pub/laks/papers/vldb98.ps.gz

Discussion Leader: James Terwilliger


Friday, 13 February 2004 @ 9:30 AM

Title: Optimizing Queries across Diverse Data Sources

VLDB 1997

Author(s): Laura M. Haas, Donald Kossmann, and Edward L. Wimmers

Available at: http://citeseer.nj.nec.com/haas97optimizing.html

Discussion Leader: Sun Murthy


Friday, 06 February 2004 @ 9:30 AM

Title: GODIVA: Lightweight Data Management for Scientific Visualization Applications

ICDE 2004

Author(s): Xiaosong Ma, Marianne Winslett, John Norris, Xiangmin Jiao, and Robert Fiedler

Available at: http://www4.ncsu.edu:8030/~xma/pubs/godiva.pdf

Discussion Leader: Laura Bright


Friday, 30 January 2004 @ 9:30 AM

Title: How Do People Get Back to Information on the Web? How Can They Do It Better?

INTERACT 2003

Author(s): Jones, W., Bruce, H., Dumais, S.

Available at: http://kftf.ischool.washington.edu/Jones,%20Bruce,%20Dumais%20submitted%20for%20review.doc

Discussion Leader: Dave Maier


Friday, 30 January 2004 @ 9:30 AM

Title: Once found, what then?: a study of "keeping" behaviors in personal use of Web information

ASIST 2002

Author(s): Jones, W., Dumais, S., and Bruce, H.

Available at: http://kftf.ischool.washington.edu/Jones%202002%20ASIST.pdf

Discussion Leader: Dave Maier


Friday, 30 January 2004 @ 9:30 AM

Title: A system for personal information retrieval and re-use

SIGIR 2003

Author(s): S. T. Dumais, E. Cutrell, E., J. J. Cadiz, G. Jancke, R. Sarin, and D. C. Robbins

Available at: http://research.microsoft.com/copyright/accept.asp?path=http://research.microsoft.com/~sdumais/SISCore-SIGIR2003-Final.pdf&pub=ACM

Discussion Leader: Dave Maier


Friday, 23 January 2004 @ 9:30 AM

Title: Scheduling for shared window joins over data streams

Author(s): Moustafa A. Hammad, Micheal J. Franklin, Walid G. Arel, and Ahmed K. Elmagarmid

Available at: http://citeseer.nj.nec.com/590715.html

Discussion Leader: Jin Li


Friday, 16 January 2004 @ 9:30 AM

Title: Querying Heterogeneous XML Sources through a Conceptual Schema

ER 2003

Author(s): Sandro Daniel Camillo, Carlos Alberto Heuser, and Ronaldo dos Santos Mello

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre


Friday, 12 December 2003 @ 9:30 AM

Title: Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery

Author(s): S. Chakrabarti, M. van den Berg, and B. Dom

Available at: http://www.cs.berkeley.edu/~soumen/doc/www1999f/pdf/www1999f.pdf

Discussion Leader: Vinit Kalra


Friday, 21 November 2003 @ 9:30 AM

Title: Sorting And Indexing With Partitioned B-Trees

Proceedings of CIDR 2003

Author(s): Goetz Graefe

Available at: http://www-db.cs.wisc.edu/cidr/program/p1.pdf

Discussion Leader: Dave Maier


Friday, 14 November 2003 @ 9:30 AM

Title: Locating Data Sources in Large Distributed Systems

Proceedings of VLDB 2003

Author(s): Leonidas Galanis, Yuan Wang, Shawn R. Jeffery, David J. DeWitt

Available at: http://www.cs.cornell.edu/database/lunch/Fall2003/670_Galanis.pdf

Discussion Leader: Vassilis Papadimos


Friday, 07 November 2003 @ 9:30 AM

Title: A Data Model for Distributed Multiresolution Multisource Scientific Data

Proceedings of ISOFSEM 2002

Author(s): Philip J. Rhodes, R. Daniel Bergeron, and Ted M. Sparr

Available at: http://www.cs.unh.edu/~sdb/rhodes/Tahoe.pdf

Discussion Leader: Bill Howe


Friday, 31 October 2003 @ 9:30 AM

Title: Notions of Indistinguishability for Semantic Web Languages

You can also find a hard copy of the paper in the bin in the CSE Compton reception area

Author(s): Jaap Kamps, Maarten Marx

Available at: http://www.springerlink.com/app/home/content.asp?wasp=9b6a7pvtwn5tub0mhm2u&referrer=contribution&format=2&page=1&pagecount=0

Discussion Leader: Susan Price


Friday, 24 October 2003 @ 9:30 AM

Title: Extending the Role of Digital Library: Computer Support for Creating and Using the Literature

You can also find a hard copy of the paper in the bin in the CSE Compton reception area

Author(s): L. Carr, T. Miles-Board, G. Wills, G. Power, C. Bailey, W. Hall, and S. Grange

Available at: http://eprints.ecs.soton.ac.uk/secure/00007251/01/extending-diglib.pdf

Discussion Leader: Sun Murthy


Friday, 17 October 2003 @ 9:30 AM

Title: Aurora: A New Model and Architecture for Data Stream Management

Author(s): D. Abadi, D. Carney, U. Cetintemel, M. Cherniack, C. Convey, S. Lee, M. Stonebraker, N. Tatbul, and S. Zdonik

Available at: http://www.cs.brown.edu/~ugur/vldbj.pdf

Discussion Leader: Jin Li


Friday, 10 October 2003 @ 9:30 AM

Title: MetaXPath

Proceedings of 2001 Intl. Conf. on Dublin Core and Metadata Applications

Author(s): C. E. Dyreson, M. H. Bohlen, and C. S. Jensen

Available at: http://research.nii.ac.jp/~oyama/dc2001/proceedings/product/paper-03.pdf

Discussion Leader: Fang Du


Friday, 05 September 2003 @ 9:30 AM

Title: Query Processing of Streamed XML Data

11th International Conference on Information and Knowledge Management (CIKM'2002). McLean, VA, November 2002

Author(s): 1. Leonidas Fegaras, David Levine, Sujoe Bose, and Vamsi Chaluvadi

Available at: http://lambda.uta.edu/spapers.html

Discussion Leader: Dave Maier


Friday, 29 August 2003 @ 9:30 AM

Title: E-services: A Look Behind the Curtain

PODS 2003

Author(s): Richard Hull, Michael Benedikt, Vassilis Christophides, Jianwen Su

Available at: http://www.acm.org/sigmod/pods/proc03/online/003-hull.pdf

Discussion Leader: Juliana Freire


Friday, 22 August 2003 @ 9:30 AM

Title: Topical Relevance Relationships. I. Why Topic Matching Fails

Journal of the American Society for Information Science 46(9): 646-653, 1995

Author(s): Rebecca Green

Available at: http://download.interscience.wiley.com/cgi-bin/fulltext?ID=10050231&PLACEBO=IE.pdf&mode=pdf

Discussion Leader: Susan Price


Friday, 15 August 2003 @ 9:30 AM

Title: Querying Structured Text in an XML Database

SIGMOD 2003

Author(s): Shurug Al-Khalifa, Cong Yu, H. V. Jagadish

Available at: http://www.eecs.umich.edu/~congy/work/sigmod03.pdf

Discussion Leader: Sun Murthy


Friday, 08 August 2003 @ 9:30 AM

Title: Window Explained, Window Expressed

Author(s): Sirish Chandrasekaran, Sailesh Krishnamurthy, Samuel Madden, Amol Deshpande, Micheal J. Franklin, Joseph M. Hellerstein, Mehul Shah

Available at: http://www.cs.berkeley.edu/~sirish/research/streaquel.pdf

Discussion Leader: Jenny Li


Friday, 01 August 2003 @ 9:30 AM

Title: Scientific Data Repositories - Designing for a Moving Target

SIGMOD 2003

Author(s): Etzard Stolte, Christoph von Praun, Gustavo Alonso, Thomas Gross

Available at: http://www.inf.ethz.ch/personal/alonso/PAPERS/SIGMOD03.pdf

Discussion Leader: Bill Howe


Friday, 25 July 2003 @ 9:30 AM

Title: Cache-and-Query for Wide Area Sensor Databases

SIGMOD 2003

Author(s): Amol Deshpande, Suman Nath, Phillip B. Gibbons, Srinivasan Seshan

Available at: http://www-2.cs.cmu.edu/~sknath/papers/D+03.pdf

Discussion Leader: Lingzhi Zhang


Friday, 11 July 2003 @ 9:30 AM

Title: Warping Indexes with Envelope Transforms for Query by Humming

SIGMOD 2003

Author(s): Yunyue Zhu, Dennis Shasha

Available at: http://www.cs.nyu.edu/cs/faculty/shasha/papers/humming.pdf

Discussion Leader: Pete Tucker


Friday, 13 June 2003 @ 9:30 AM

Title: Leveraging a Common Representation for Personalized Search and Summarization in a Medical Digital Library

Proc. of JCDL, 2003, Houston, TX

Author(s): Kathleen McKeown, Noemie Elhadad and Vassilis Hatzivassiloglou

Available at: http://www1.cs.columbia.edu/~noemie/papers/jcdl03.ps

Discussion Leader: Susan Price


Friday, 06 June 2003 @ 9:30 AM

Title: The hypercontext framework for adaptive Hypertext

The proceedings of the thirteenth conference on hypertext and hypermedia June 11-15, 2002, College Park, Maryland, USA; Pages 11-20.

Author(s): Christopher D. Staff

Available at: http://doi.acm.org/10.1145/513338.513346

Discussion Leader: Sun Murthy


Friday, 30 May 2003 @ 9:30 AM

Title: Beyond Average: Towards Sophisticated Sensing with Queries

2nd International Workshop on Information Processing in Sensor Networks (IPSN '03)

Author(s): Joseph M. Hellerstein, Wei Hong, Samuel Madden, and Kyle Stanek

Available at: http://www.cs.berkeley.edu/~madden/beyond_average_ipsn.pdf

Discussion Leader: Pete Tucker


Friday, 23 May 2003 @ 9:30 AM

Title: SQL and Management of External Data

SIGMOD Record Volume 30 Number 1 March 2001

Author(s): J. Melton, J. Michels, V. Josifovski, K. Kulkarni, P. Schwarz, K. Zeidenstein

Available at: http://www.acm.org/sigmod/record/issues/0103/JM-Sta.pdf

Discussion Leader: Dave Maier


Friday, 23 May 2003 @ 9:30 AM

Title: SQL/MED - A Status Report

SIGMOD Record Volume 31 Number 3 September 2002

Author(s): J. Melton, J. Michels, V. Josifovski, K. Kulkarni, P. Schwarz

Available at: http://www.acm.org/sigmod/record/issues/0209/jimmelton.pdf

Discussion Leader: Dave Maier


Friday, 09 May 2003 @ 9:30 AM

Title: Comparing Sets of Semantic Relations in Ontologies

Author(s): Eduard Hovy

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre


Friday, 02 May 2003 @ 9:30 AM

Title: Incremental Validation of XML Documents

ICDT 2003, LNCS 2572, p 64-79

Author(s): Yannis Papakonstantinou and Victor Vianu

Available at: http://link.springer-ny.com/link/service/series/0558/papers/2572/25720047.pdf

Discussion Leader: Denilson Barbosa


Friday, 18 April 2003 @ 9:30 AM

Title: Dynamic XML Documents with Distribution and Replication

SIGMOD 2003

Author(s): Serge Abiteboul, Angela Bonifati, Gregory Cobena, Ioana Manolescu, and Tova Milo

Available at: ftp://ftp.inria.fr/INRIA/Projects/verso/gemo/GemoReport-272.ps

Discussion Leader: Vassilis Papadimos


Friday, 11 April 2003 @ 9:30 AM

Title: Crawling the Hidden Web

VLDB 2001

Author(s): Sriram Raghavan and Hector Garcia-Molina

Available at: http://dbpubs.stanford.edu:8090/pub/2001-19

Discussion Leader: Vinit Kalra


Friday, 21 March 2003 @ 9:30 AM

Title: SEQ: A Model for Sequence Databases.

Author(s): Praveen Seshadri, Miron Livny and Raghu Ramakrishnan.

Available at: http://www.cs.cornell.edu/home/praveen/papers/seq.de95.ps.Z

Discussion Leader: Pete Tucker


Friday, 07 March 2003 @ 9:30 AM

Title: Crossing the Structure Chasm

CIDR 2003

Author(s): Alon Halevy, Oren Etzioni, AnHai Doan, Zachary Ives, Jayant Madhavan, Luke McDowell, and Igor Tatarinov

Available at: http://www-db.cs.wisc.edu/cidr/program/p11.pdf

Discussion Leader: Sun Murthy


Friday, 28 February 2003 @ 9:30 AM

Title: Online Aggregation

Author(s): Joseph M. Hellerstein, Peter J. Haas, and Helen J. Wang

Available at: http://control.cs.berkeley.edu/online/online.pdf

Discussion Leader: Jin Li


Friday, 21 February 2003 @ 9:30 AM

Title: Validating streaming XML documents

Author(s): Luc Segoufin, and Victor Vianu

Available at: http://doi.acm.org/10.1145/543613.543622

Discussion Leader: Lingzhi Zhang


Friday, 14 February 2003 @ 9:30 AM

Title: Decomposition - A Strategy for Query Processing

TODS 1(3): 223-241

Author(s): Eugene Wong, and Karel Youssefi

Available at: http://doi.acm.org/10.1145/320473.320479

Discussion Leader: Vassilis Papadimos


Friday, 07 February 2003 @ 9:30 AM

Title: Efficient Exploration of Large Scientific Databases.

VLDB 2002

Author(s): Etzard Stolte, and Gustavo Alonso.

Available at: http://www.inf.ethz.ch/department/IS/iks/publications/files/savldb02.pdf

Discussion Leader: Bill Howe


Friday, 24 January 2003 @ 9:30 AM

Title: A Mapping Schema and Interface for XML Stores

Author(s): Sihem Amer-Yahia

Available at: http://www.research.att.com/~sihem/WIDM02.pdf

Discussion Leader: Fang Du


Friday, 17 January 2003 @ 9:30 AM

Title: The Yin/Yang Web: XML Syntax and RDF Semantics

2002

Author(s): Peter Patel-Schneider and Jerome Simeon

Available at: http://www-db.research.bell-labs.com/user/pfps/papers/yin-yang.pdf

Discussion Leader: Susan Price


Friday, 13 December 2002 @ 9:30 AM

Title: Self-tuning Database Technology and Information Services: from Wishful Thinking to Viable Engineering

Author(s): Gerhard Weikum, Axel Moenkeberg, Christof Hasse, and Peter Zabback

Available at: http://www.vldb.org/conf/2002/S02P02.pdf

Discussion Leader: Vassilis Papadimos


Friday, 06 December 2002 @ 9:30 AM

Title: LEO - DB2's LEarning Optimizer.

VLDB 2001

Author(s): Michael Stillger, Guy M. Lohman, Volker Markl, and Mokhtar Kandil

Available at: http://www.dia.uniroma3.it/~vldbproc/006_019.pdf

Discussion Leader: Juliana Freire


Friday, 22 November 2002 @ 9:30 AM

Title: Learning to Map between Ontologies on the Semantic Web

Author(s): Doan, Madhavan, Domingos, and Halevy

Available at: http://www2002.org/CDROM/refereed/232/index.html

Discussion Leader: Susan Price


Friday, 15 November 2002 @ 9:30 AM

Title: Storing and Querying Ordered XML Using a Relational Database System

SIGMOD 2001

Author(s): Igor Tatarinov, Stratis Viglas, Kevin S. Beyer, Jayavel Shanmugasundaram, Eugene J. Shekita, and Chun Zhang

Available at: http://doi.acm.org/10.1145/564691.564715

Discussion Leader: Lingzhi Zhang


Friday, 08 November 2002 @ 9:30 AM

Title: Translating Web Data

Proceedings of VLDB 2002, Hong Kong SAR, China

Author(s): Lucian Popa, Yannis Velegrakis, Renee J. Miller, Mauricio A. Hernandez, and Ronald Fagin

Available at: http://www.vldb.org/conf/2002/S17P02.pdf

Discussion Leader: Lois Delcambre


Friday, 01 November 2002 @ 9:30 AM

Title: A Transducer-Based XML Query Processor

Author(s): Bertram Ludaescher, Pratik Mukhopadhyay, and Yannis Papakonstantinou

Available at: http://www.cs.ust.hk/vldb2002/VLDB2002-papers/S07P03.pdf

Discussion Leader: Dave Maier


Friday, 25 October 2002 @ 9:30 AM

Title: Pipelining in Multi-Query Optimization

PODS 2001

Author(s): Nilesh N. Dalvi, Sumit K. Sanghai, Prasan Roy, and S. Sudarshan

Available at: http://www.acm.org/sigmod/pods/proc01/online/p49.pdf

Discussion Leader: Bill Howe


Friday, 18 October 2002 @ 9:30 AM

Title: Augmenting Thesaurus Relationships: Possibilities for Retrieval

Author(s): Douglas Tudhope, Harith Alani, and Christopher Jones

Available at: http://jodi.ecs.soton.ac.uk/Articles/v01/i08/Tudhope/

Discussion Leader: Mat Weaver


Friday, 04 October 2002 @ 10:25 AM

Title:

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre or Shawn Bowers


Friday, 13 September 2002 @ 9:30 AM

Title: ExpansionTool: Concept Based Query Expansion and Construction

Information Retrieval 4(3/4), pp. 231-255

Author(s): Kalervo Jarvelin, Jaana Kekalainen, Timo Niemi

Available at: http://www.info.uta.fi/tutkimus/fire/archive/ET3-IR01.pdf

Discussion Leader: Mat Weaver


Friday, 06 September 2002 @ 9:30 AM

Title: How To Query Network Traffic Data Using Data Streams

Author(s): Chuck Cranor, Theodore Johnson and Oliver Spatscheck

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Pete Tucker


Friday, 23 August 2002 @ 9:30 AM

Title: Hyperqueries: Dynamic Distributed Query Processing on the Internet

VLDB 2001

Author(s): Alfons Kemper, Christian Wiesner

Available at: http://www.vldb.org/conf/2001/P551.pdf

Discussion Leader: Vassilis Papadimos


Friday, 16 August 2002 @ 9:30 AM

Title: What can databases do for Peer-to-Peer?

WebDB 2001

Author(s): Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu

Available at: http://data.cs.washington.edu/papers/p2p.pdf

Discussion Leader: Vassilis Papadimos


Friday, 09 August 2002 @ 9:30 AM

Title: Data-Driven Understanding and Refinement of Schema Mappings

SIGMOD 01

Author(s): Yan, Miller, Haas, Fagin

Available at: http://doi.acm.org/10.1145/375663.375729

Discussion Leader: Shawn Bowers


Friday, 26 July 2002 @ 9:30 AM

Title: Compound Descriptors in Context: A Matching Function for Classifications and Thesauri

In JCDL 2002, pp. 84--93

Author(s): Douglas Tudhope, Ceri Binding, Dorothee Blocks, Daniel Cunliffe

Available at: http://web.glam.ac.uk/schools/soc/research/hypermedia/publications/jcdl02.pdf

Discussion Leader: Lois Delcambre


Friday, 26 July 2002 @ 9:30 AM

Title: A Methodology and System for Preserving Digital Data

In JCDL 2002, pp. 312--319

Author(s): Raymond A. Lorie

Available at: http://www.cse.ogi.edu/dot/dbrg/p69-lorie.pdf

Discussion Leader: Lois Delcambre


Friday, 19 July 2002 @ 9:30 AM

Title: Rules of Thumb in Data Engineering

ICDE 2000

Author(s): Jim Gray, Prashant J. Shenoy

Available at: http://research.microsoft.com/copyright/accept.asp?path=ftp://ftp.research.microsoft.com/pub/tr/tr-99-100.pdf&pub=3

Discussion Leader: Dave Maier


Friday, 12 July 2002 @ 9:30 AM

Title: Annotea: An Open RDF Infrastructure for Shared Web Annotations

WWW10 2001 Hong Kong

Author(s): Jose Kahan, Marja-Riitta Koivunen

Available at: http://www.www10.org/cdrom/papers/pdf/p488.pdf

Discussion Leader: Sun Murthy


Friday, 28 June 2002 @ 9:30 AM

Title: Chimera: A Virtual Data System or Representing, Querying, and Automating Data Derivation

SSDBM 2002

Author(s): Ian Foster

Available at: http://www.griphyn.org/mail_archive/all/pdf00004.pdf

Discussion Leader: Bill Howe


Friday, 14 June 2002 @ 9:30 AM

Title: Holistic Twig Joins: Optimal XML Pattern Matching

In SIGMOD 2002

Author(s): Nicolas Bruno, Divesh Srivastava, Nick Koudas

Available at: http://www.acm.org/sigs/sigmod/sigmod02/eproceedings/papers/Research-Bruno-et-al-1.pdf

Discussion Leader: Vassilis Papadimos


Friday, 24 May 2002 @ 9:30 AM

Title: Mixing querying and navigation in MIX.

In Proc. of the 18th International Conference on Data Engineering, pp. 245--254, 2002.

Author(s): P. Mukhopadhyay and Y. Papakonstantinou

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Shawn Bowers


Friday, 10 May 2002 @ 9:30 AM

Title: Fine Grained Access Control for SOAP E-Services

In: Proceedings of Tenth International World Wide Web Conference (WWW10); 2001; May 1-5; Hong Kong.

Author(s): Damiani E, Vimercati S, Paraboschi S, Samarati P.

Available at: http://www.www10.org/cdrom/papers/pdf/p129.pdf

Discussion Leader: Sun Murthy


Friday, 10 May 2002 @ 9:30 AM

Title: Latency Performance of SOAP Implementations.

To be published in: Proceedings of 2nd International Workshop on Global and Peer-to-Peer on Large Scale Distributed Systems. IEEE International Symposium on Cluster Computing and the Grid; 2002; May; Berlin, Germany.

Author(s): Davis D, Parashar M.

Available at: http://www.caip.rutgers.edu/TASSL/Papers/p2p-p2pws02-soap.pdf

Discussion Leader: Sun Murthy


Friday, 03 May 2002 @ 9:30 AM

Title: Continuously Adaptive Continuous Queries over Streams

In SIGMOD 2002

Author(s): Samuel Madden, Mehul Shah, Joseph M. Hellerstein, Vijayshankar Raman

Available at: http://db.cs.berkeley.edu/papers/sigmod02-cacq.pdf

Discussion Leader: Dave Maier


Friday, 26 April 2002 @ 9:30 AM

Title: Monitoring Streams -- A New Class of Data Management Applications

Brown CS Tech Report TR-CS-02-04

Author(s): Don Carney et al.

Available at: http://www.cs.brown.edu/research/aurora/

Discussion Leader: Pete Tucker


Friday, 19 April 2002 @ 9:30 AM

Title: A Framework of Guidance for Building Good Digital Collections

This is a draft version of an unpublished paper -- please do not redistribute.

Author(s):

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre


Friday, 12 April 2002 @ 9:30 AM

Title: Scientific Workflow Management by Database Management

In SSDBM '98

Author(s): Anastassia Ailamaki, Yannis E. Ioannidis, Miron Livny

Available at: http://www.db.cs.cmu.edu/Pubs/Lib/scistat98/sciwf.pdf

Discussion Leader: Bill Howe


Friday, 22 March 2002 @ 9:30 AM

Title: On bounding-schemas for LDAP directories.

In Proceedings of the 7th International Conference on Extending Database Technology, Konstanz, Germany

Author(s): S. Amer-Yahia, H. Jagadish, L. Lakshmanan, D. Srivastava.

Available at: http://link.springer-ny.com/link/service/series/0558/papers/1777/17770287.pdf

Discussion Leader: Shawn Bowers


Friday, 08 March 2002 @ 9:30 AM

Title: A Formal Ontology of Properties

EKAW 2000

Author(s): Nicola Guarino and Christopher Welty

Available at: http://www.ladseb.pd.cnr.it/infor/Ontology/Papers/EKAW-2000.pdf

Discussion Leader: Mat Weaver


Friday, 15 February 2002 @ 9:30 AM

Title: A Unified Framework for Data Translation over the Web

Both this and the next paper will be discussed in the same session!

Author(s): Ricardo Torlone and Paolo Atzeni

Available at: http://www.dia.uniroma3.it/~torlone/pubs/wise01.ps

Discussion Leader: Lois Delcambre


Friday, 15 February 2002 @ 9:30 AM

Title: VIKI: Spatial Hypertext Supporting Emergent Structure

Author(s): Catherine C. Marshall, Frank M. Shipman III, James H. Coombs

Available at: http://doi.acm.org/10.1145/192757.192759

Discussion Leader: Lois Delcambre


Friday, 08 February 2002 @ 9:30 AM

Title: Fjording the Stream: An Architecture for Queries over Streaming Sensor Data

To appear in ICDE 2002

Author(s): Samuel Madden, Michael J. Franklin

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Pete Tucker


Friday, 01 February 2002 @ 9:30 AM

Title: Java Support for Data-Intensive Systems: Experiences Building the Telegraph Dataflow System

SIGMOD Record 30(4), December 2001

Author(s): M. A. Shah, S. Madden, M. Franklin, and J.M. Hellerstein

Available at: http://www.acm.org/sigmod/record/issues/0112/sys-telegraph.pdf

Discussion Leader: Vassilis Papadimos


Friday, 25 January 2002 @ 9:30 AM

Title: Supporting user-defined activity spaces.

In proceedings of: Conference on Hypertext and Hypermedia 1997; Southampton; UK. Pages 112-123;

Author(s): Wang W, Haake J

Available at: http://journals.ecs.soton.ac.uk/~lac/ht97/pdfs/wang.pdf

Discussion Leader: Sun Murthy


Friday, 18 January 2002 @ 9:30 AM

Title: The Roma personal Metadata service

to appear in Mobile Networks and Applications (MONET) 2002

Author(s): Edward Swierk, Emre Kman, Nathan C. Williams, Takashi Fukushima, Hideki Yoshida, Vince Laviano and Mary Baker

Available at: http://mosquitonet.stanford.edu/publications/Roma-MONET.pdf

Discussion Leader: Bill Howe


Friday, 11 January 2002 @ 9:30 AM

Title: Archiving Scientific Data

Author(s): Buneman, Khanna, Tajima, Tan

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Dave Maier


Friday, 07 December 2001 @ 9:30 AM

Title: Hypertext Interaction Revisited

Author(s): G. Golovchinsky and C. Marshall

Available at: http://www.acm.org/pubs/articles/proceedings/hypertext/336296/p171-golovchinsky/p171-golovchinsky.pdf

Discussion Leader: Sun Murthy


Friday, 30 November 2001 @ 9:30 AM

Title: Surfing Wavelets in Streams: One-Pass Summaries for Approximate Aggregate Queries

VLDB 2001

Author(s): A. Gilbert et al.

Available at: http://www.dia.uniroma3.it/~vldbproc/012_079.pdf

Discussion Leader: Pete Tucker


Friday, 16 November 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Bill Howe


Friday, 16 November 2001 @ 9:30 AM

Title: Querying the Physical World

Author(s): Bonnet, Gehrke, Seshadri

Available at: http://citeseer.nj.nec.com/308644.html

Discussion Leader: Bill Howe


Friday, 02 November 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Vassilis Papadimos


Friday, 26 October 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Dave Maier


Friday, 19 October 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Lois Delcambre


Friday, 12 October 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Mat Weaver


Friday, 05 October 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Shawn Bowers


Friday, 31 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Dave


Friday, 24 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Mat


Friday, 17 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Sun


Friday, 10 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Lois


Friday, 03 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Everyone


Friday, 27 July 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Pete Tucker


Friday, 20 July 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: David Maier


Friday, 13 July 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Vassilis Papadimos


Friday, 01 June 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Pete Tucker


Friday, 25 May 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Mathew Weaver


Friday, 11 May 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Foula Vagena


Friday, 04 May 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Dave Maier


Friday, 27 April 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Shawn Bowers


Friday, 13 April 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Vasileios Papadimos


Friday, 02 March 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Foula Vagena


Friday, 23 February 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Lois Delcambre


Friday, 16 February 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Pete Tucker


Friday, 09 February 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Shawn Bowers


Friday, 02 February 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Mathew Weaver


Friday, 26 January 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: David Maier


Friday, 12 January 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at: http://www.cs.washington.edu/homes/etzioni/papers/be_www2005.pdf

Discussion Leader: Kevin Beck


Wednesday, 31 December 1969 @ 4:00 PM

Title: Privacy-preserving record linkage using Bloom filters

BMC Medical Informatics and Decision Making 2009, 9:41

Author(s): Rainer Schnell, Tobias Bachteler and Jörg Reiher

Available at: http://www.biomedcentral.com/content/pdf/1472-6947-9-41.pdf

Discussion Leader: Abdussalam Alawini


Wednesday, 31 December 1969 @ 4:00 PM

Title: Photon: Fault-tolerant and Scalable Joining of Continuous Data Streams

SIGMOD 2013

Author(s): Rajagopal Ananthanarayanan, Venkatesh Basker, Sumit Das, Ashish Gupta, Haifeng Jiang, Tianhao Qiu, Alexey Reznichenko, Deomid Ryabkov, Manpreet Singh, Shivakumar Venkataraman

Available at: http://dl.acm.org/citation.cfm?id=2465272

Discussion Leader: Chris


Wednesday, 31 December 1969 @ 4:00 PM

Title: Coordination Avoidance in Database Systems

VLDB 2015 - Proceedings of the VLDB Endowment, Vol. 8, No. 3

Author(s): Peter Bailis, Alan Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica

Available at: http://www.bailis.org/papers/ca-vldb2015.pdf

Discussion Leader: Jeremiah Peschka


Wednesday, 31 December 1969 @ 4:00 PM

Title: Customized Random Walk for Generating Wikipedia Article Recommendations

Not published

Author(s): Jocelyn Hickcox and Chris Min

Available at: https://pdfs.semanticscholar.org/cf43/e1d4c94f85f5de4e36b9ab777595d0253b56.pdf

Discussion Leader: Hisham


Wednesday, 31 December 1969 @ 4:00 PM

Title: Rough sets and intelligent data analysis

Information Sciences 147 (2002) 1–12

Author(s): Zdzisław Pawlak

Available at: http://bcpw.bg.pw.edu.pl/Content/1932/infSci2002.pdf

Discussion Leader: Basem


Wednesday, 31 December 1969 @ 4:00 PM

Title: Seven Databases in Seven Weeks

CMU Seminar series

Author(s): CMU Seminar series

Available at: http://db.cs.cmu.edu/seminar2014/

Discussion Leader: Dave


Wednesday, 31 December 1969 @ 4:00 PM

Title: Plenario: An Open Data Discovery and Exploration Platform for Urban Science

IEEE Data Engineering Bulletin, Dec 2014

Author(s): C. Cattlett et al.

Available at: http://sites.computer.org/debull/A14dec/p27.pdf

Discussion Leader: Dave


Wednesday, 31 December 1969 @ 4:00 PM

Title: The Snowflake Elastic Data Warehouse

SIGMOD '16 Proceedings of the 2016 International Conference on Management of Data Pages 215-226 ACM

Author(s): Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Vadim Antonov, Artin Avanes, Jon Bock, Jonathan Claybaugh, Daniel Engovatov, Martin Hentschel, Jiansheng Huang, Allison W. Lee, Ashish Motivala, Abdul Q. Munir, Steven Pelley, Peter Povinec, Greg Rahn, S

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Shree