Database Reading Group

Papers previously discussed (most recent first):

Monday, 18 November 2024 @ 11:00 AM

Title: Cleaning Denial Constraint Violations through Relaxation


Author(s): Stella Giannakopoulou, Manos Karpathiotakis and Anastasia Ailamaki

Available at:

Discussion Leader: Nicholas Morales

Monday, 28 October 2024 @ 11:00 AM

Title: Can Increasing the Hit Ratio Hurt Cache Throughput?

arXiv 2024

Author(s): Ziyue Qiu, Juncheng Yang, Mor Harchol-Balter

Available at:

Discussion Leader: David Maier

Monday, 14 October 2024 @ 11:00 AM

Title: Disclosure-compliant Query Answering


Author(s): Rudi Poepsel-Lemaitre, Kaustubh Beedkar and Volker Markl

Available at:

Discussion Leader: Dr. Primal Pappachan

Friday, 31 May 2024 @ 10:30 AM

Title: POLAR: Adaptive and Non-invasive Join Order Selection via Plans of Least Resistance

Proceedings of the VLDB Endowment, 2024

Author(s): David Justen, Daniel Ritter, Campbell Fraser, Andrew Lamb, Allison Lee, Thomas Bodner, Mhd Yamen Haddad, Steffen Zeuch, Volker Markl, and Matthias Boehm

Available at:

Discussion Leader: Dylan Conklin

Friday, 17 May 2024 @ 10:30 AM

Title: Fast Approximate Denial Constraint Discovery

Proceedings of the VLDB Endowment Volume 16Issue 2

Author(s): Renjie Xiao (Fudan University), Zijing Tan (Fudan University), Haojin Wang (Fudan University), Shuai Ma (Beihang University)

Available at:

Discussion Leader: Nicholas Morales

Friday, 03 May 2024 @ 10:30 AM

Title: Optimizing Disjunctive Queries with Tagged Execution

arXiv 2024

Author(s): Albert Kim (MIT), Samuel Madden (MIT)

Available at:

Discussion Leader: Anadi Shakya

Friday, 26 April 2024 @ 10:30 AM

Title: Enabling Personal Consent in Databases

VLDB 2022

Author(s): George Konstantinidis (University of Southampton, UK), Jet Holt (University of Southamption, UK), Adriane Chapman (University of Southamption, UK)

Available at:

Discussion Leader: Dr. Primal Pappachan

Friday, 19 April 2024 @ 10:30 AM

Title: DARQ Matter Binds Everything: Performant and Composable Cloud Programming via Resilient Steps


Author(s): Tianyu Li (Massachusetts Institute of Technology); Badrish Chandramouli (Microsoft Research); Sebastian C Burckhardt (Microsoft Research); Samuel Madden (MIT)

Available at:

Discussion Leader: Prof. Dave Maier

Friday, 07 June 2019 @ 10:00 AM

Title: Navigating the Data Lake with Datamaran: Automatically Extracting Structure from Log Datasets

Sigmod '18

Author(s): Yihan Gao, Silu Huang, Aditya Parameswaran

Available at:

Discussion Leader: Chris

Friday, 31 May 2019 @ 10:00 AM

Title: Snorkel: Rapid Training Data Creation with Weak Supervision

VLDB 2018

Author(s): Alexander Ratner, Stephen H. Bach, Henry Ehrenberg, Jason Fries, Sen Wu, Christopher Re

Available at:

Discussion Leader: Chris

Friday, 24 May 2019 @ 10:00 AM

Title: Gorilla: A Fast, Scalable, In-Memory Time Series Database

VLDB 2015

Author(s): Pelkonen, Tuomas and Franklin, Scott and Teller, Justin and Cavallaro, Paul and Huang, Qi and Meza, Justin and Veeraraghavan, Kaushik

Available at:

Discussion Leader: Basem

Friday, 17 May 2019 @ 10:00 AM

Title: Compile-Time Optimization of Embedded Data-Intensive Query Languages

IEEE BigData Congress, San Francisco, July 2018

Author(s): Leonidas Fegaras and Md Hasanuzzaman Noor

Available at:

Discussion Leader: Leonidas Fegaras

Friday, 10 May 2019 @ 10:00 AM

Title: Pixie: A System for Recommending 3+ Billion Items to 200+ Million Users in Real-Time

WWW 2018

Author(s): Chantat Eksombatchai, Pranav Jindal, Jerry Zitao Liu, Yuchen Liu, Rahul Sharma, Charles Sugnet, Mark Ulrich, Jure Leskovec

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Hisham

Friday, 26 April 2019 @ 10:00 AM

Title: Software Platforms for Smart Cities: Concepts, Requirements, Challenges, and a Unified Reference Architecture

ACM Comput. Surv. 50, 6, Article 78 (November 2017)

Author(s): Eduardo Felipe Zambom Santana, Ana Paula Chaves, Marco Aurelio Gerosa, Fabio Kon, and Dejan S. Milojicic

Available at:

Discussion Leader: Dave

Friday, 19 April 2019 @ 10:00 AM

Title: Software Platforms for Smart Cities: Concepts, Requirements, Challenges, and a Unified Reference Architecture

ACM Comput. Surv. 50, 6, Article 78 (November 2017)

Author(s): Eduardo Felipe Zambom Santana, Ana Paula Chaves, Marco Aurelio Gerosa, Fabio Kon, and Dejan S. Milojicic

Available at:

Discussion Leader: Dave

Friday, 15 March 2019 @ 10:00 AM

Title: ModelarDB: Modular Model-Based Time Series Management with Spark and Cassandra

VLDB 2018

Author(s): Jensen, Soren Kejser and Pedersen, Torben Bach and Thomsen, Christian

Available at:

Discussion Leader: Basem

Friday, 08 March 2019 @ 10:00 AM

Title: Coconut: A Scalable Bottom-Up Approach for Building Data Series Indexes

VLDB 2018

Author(s): Haridimos Kondylakis, Niv Dayan, Kostas Zoumpatianos, Themis Palpanas

Available at:

Discussion Leader: Chris

Friday, 01 March 2019 @ 10:00 AM

Title: Northstar: An Interactive Data Science System

VLDB 2018

Author(s): Tim Kraska

Available at:

Discussion Leader: Hisham

Friday, 22 February 2019 @ 10:00 AM

Title: Ten Years of WebTables

VLDB 2018

Author(s): Michael Cafarella, Alon Halevy, Hongrae Lee, Jayant Madhavan, Cong Yu, Daisy Zhe Wang, Eugene Wu

Available at:

Discussion Leader: Basem

Friday, 15 February 2019 @ 10:00 AM

Title: Everything You Always Wanted to Know About Compiled and Vectorized Queries But Were Afraid to Ask

VLDB 2018

Author(s): Timo Kersten, Viktor Leis, Alfons Kemper, Thomas Neumann, Andrew Pavlo, Peter Boncz

Available at:

Discussion Leader: Dave

Friday, 01 February 2019 @ 10:00 AM

Title: User Interests Identifcation on Twitter Using a Hierarchical Knowledge Base.

European Semantic Web Conference 2014

Author(s): Pavan Kapanipathi, Prateek Jain, Chitra Venkataramani, and Amit Sheth

Available at:

Discussion Leader: Hisham

Friday, 25 January 2019 @ 10:00 AM

Title: How Good Are Spatial Analytics Systems?

VLDB 2018

Author(s): Varun Pandey, Andreas Kipf, Thomas Neumann, Alfons Kemper

Available at:

Discussion Leader: Chris

Friday, 18 January 2019 @ 10:00 AM

Title: Open Data Integration

VLDB 2018

Author(s): Renee J. Miller

Available at:

Discussion Leader: Dave

Friday, 30 November 2018 @ 12:00 AM

Title: D3: Data-Driven Documents


Author(s): Michael Bostock, Vadim Ogievetsky, Jeffrey Heer

Available at:

Discussion Leader: Basem

Friday, 16 November 2018 @ 10:00 AM

Title: ObjectRank: Authority-Based Keyword Search in Databases

VLDB 2004

Author(s): Andrey Balmin, Vagelis Hristidis, and Yannis Papakonstantinou

Available at:

Discussion Leader: Hisham

Friday, 09 November 2018 @ 10:00 AM

Title: Extending the Database Relational Model to Capture More Meaning

ACM Transactions on Database Systems

Author(s): E. F. Codd

Available at:

Discussion Leader: Paul

Friday, 02 November 2018 @ 10:00 AM

Title: Selecting representative and diverse spatio-textual posts over sliding windows.

SSDBM '18 Proceedings of the 30th International Conference on Scientific and Statistical Database Management

Author(s): Dimitris Sacharidis, Paras Mehta, Dimitrios Skoutas, Kostas Patroumpas, and Agnès Voisard

Available at:

Discussion Leader: Dave

Friday, 26 October 2018 @ 10:00 AM

Title: An overview of deterministic database systems

Communications of the ACM Volume 61 Issue 9, September 2018 Pages 78-88

Author(s): Daniel J. Abadi, Jose M. Faleiro

Available at:

Discussion Leader: Shree

Friday, 19 October 2018 @ 10:00 AM

Title: Efficient Denial Constraint Discovery with Hydra

VLDB '18

Author(s): Tobias Bleifuss, Sebastian Kruse, Felix Naumann

Available at:

Discussion Leader: Chris

Friday, 12 October 2018 @ 10:00 AM

Title: Numerically stable parallel computation of (co-)variance

SSDBM '18 Proceedings of the 30th International Conference on Scientific and Statistical Database Management

Author(s): Schubert, Gertz

Available at:

Discussion Leader: Dave

Friday, 05 October 2018 @ 10:00 AM

Title: GeoSparkViz: a scalable geospatial data visualization framework in the apache spark ecosystem

SSDBM '18 Proceedings of the 30th International Conference on Scientific and Statistical Database Management

Author(s): Yu, Zhang, Sarwat

Available at:

Discussion Leader: Chris

Friday, 08 June 2018 @ 10:00 AM

Title: GraphX: Graph Processing in a Distributed Dataflow Framewor

2014 Proceedings of the 11th USENIX Symposium on Operating Systems Design and Implementation

Author(s): Gonzalez, Joseph E., Reynold S. Xin, Ankur Dave, Daniel Crankshaw, Michael J. Franklin, and Ion Stoica

Available at:

Discussion Leader: Basem Elazzabi

Friday, 01 June 2018 @ 10:00 AM

Title: Impatience is a Virtue: Revisiting Disorder in High-Performance Log Analytics

ICDE 2018

Author(s): Badrish Chandramouli, Jonathan Goldstein, Yinan Li

Available at:

Discussion Leader: Dave Maier

Friday, 25 May 2018 @ 10:00 AM

Title: Incomplete data: what went wrong, and how to fix it


Author(s): Leonid Libkin

Available at:

Discussion Leader: Paul Jungwirth

Friday, 18 May 2018 @ 10:00 AM

Title: Learning topic models -- provably and efficiently

ICML '13

Author(s): Sanjeev Arora, Rong Ge, Yoni Halpern, David Mimno, Ankur Moitra, David Sontag, Yichen Wu, and Michael Zhu

Available at:

Discussion Leader: Shreemoyee Sarkar

Friday, 11 May 2018 @ 10:00 AM

Title: ArrayBridge: Interweaving declarative array processing in SciDB with imperative HDF5-based programs

ICDE 2018

Author(s): Haoyuan Xin, Sofoklis Floratos, Spyros Blanas, Suren Byna, Prabhat, Kesheng Wu, Paul Brown

Available at:

Discussion Leader: Dave Maier

Friday, 04 May 2018 @ 10:00 AM

Title: Resilient distributed datasets: a fault-tolerant abstraction for in-memory cluster computing

NSDI' 2012 Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation

Author(s): Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauley, Michael J. Franklin, Scott Shenker, Ion Stoica

Available at:

Discussion Leader: Basem Elazzabi

Friday, 27 April 2018 @ 10:00 AM

Title: Using Ontologies for Semantic Data Integration

A Comprehensive Guide Through the Italian Database Research Over the Last 25 Years

Author(s): Giuseppe De Giacomo, Domenico Lembo, Maurizio Lenzerini, Antonella Poggi, Riccardo Rosati

Available at:

Discussion Leader: Chris Giossi

Friday, 20 April 2018 @ 10:00 AM

Title: Using Encyclopedic Knowledge for Automatic Topic Identification

Computational Natural Language Learning, 2009

Author(s): Kino Coursey, Rada Mihalcea, and William Moen

Available at:

Discussion Leader: Hisham Benotman

Friday, 16 March 2018 @ 10:00 AM

Title: Mostly-Optimistic Concurrency Control for Highly Contended Dynamic Workloads on a Thousand Cores

VLDB 2017

Author(s): Tianzheng Wang, Hideaki Kimura

Available at:

Discussion Leader: Basem

Friday, 09 March 2018 @ 10:00 AM

Title: Towards an IoT Big Data Analytics Framework: Smart Buildings Systems

IEEE 14th International Conference on Smart City

Author(s): Muhammad Rizwan Bashir ; Asif Qumer Gill

Available at:

Discussion Leader: Dave

Friday, 02 March 2018 @ 10:00 AM

Title: Analysis of the HTTPS certificate ecosystem

Proceeding IMC '13 Proceedings of the 2013 conference on Internet measurement conference Pages 291-304

Author(s): Zakir Durumeric, James Kasten, Michael Bailey and J. Alex Halderman

Available at:

Discussion Leader: Shree

Friday, 23 February 2018 @ 10:00 AM

Title: Wikipedia as an Ontology for Describing Documents


Author(s): Zareen Saba Syed, Tim Finin and Anupam Joshi

Available at:

Discussion Leader: Hisham

Friday, 16 February 2018 @ 10:00 AM

Title: The End of a Myth: Distributed Transaction Can Scale.

VLDB 2017

Author(s): Erfan Zamanian, Carsten Binnig, Tim Kraska, Tim Harris

Available at:

Discussion Leader: Chris

Friday, 09 February 2018 @ 10:00 AM

Title: Time Series Data Cleaning: From Anomaly Detection to Anomaly Repairing

VLDB '17

Author(s): Aoqian Zhang, Shaoxu Song, Jianmin Wang, Philip Yu

Available at:

Discussion Leader: Chris

Friday, 02 February 2018 @ 10:00 AM

Title: Effective Indexing for Approximate Constrained Shortest Path Queries on Large Road Networks

VLDB 2017

Author(s): Sibo Wang, Xiaokui Xiao, Yin Yang, Wenqing Lin

Available at:

Discussion Leader: Basem

Friday, 26 January 2018 @ 10:00 AM

Title: Temporal Alignment


Author(s): Antos Dignös, Michael Böhlen, Johann Gamper

Available at:

Discussion Leader: Paul Jungwirth

Friday, 19 January 2018 @ 12:00 AM

Title: Information Quality Assessment for Facility Management

Journal of Advanced Engineering Informatics, 2017

Author(s): Puyan A. Zadeh, Guan Wang, Hasan Burak-Cavka, Sheryl Staub-French, Rachel Pottinger

Available at:

Discussion Leader: Dave

Friday, 01 December 2017 @ 10:00 AM

Title: I've Seen "Enough": Incrementally Improving Visualizations to Support Rapid Decision Making

VLDB 2017

Author(s): Sajjadur Rahman, Maryam Aliakbarpour, Ha Kyung Kong, Eric Blais, Karrie Karahalios, Aditya Parameswaran, Ronitt Rubinfield

Available at:

Discussion Leader: Basem

Friday, 17 November 2017 @ 10:00 AM

Title: Striim: A streaming analytics platform for real-time business decisions

BIRTE 2017

Author(s): Alok Pareek, Bhushan Khaladkar, Rajkumar Sen, Basar Onat, Vijay Nadimpalli, Manish Agarwal, and Nicholas Keene

Available at:

Discussion Leader: Dave

Friday, 03 November 2017 @ 10:00 AM

Title: Using Topic Models to Assess Document Relevance in Exploratory Search User Studies

CHIIR 2017

Author(s): Alan Medlar, Dorota Głowacka

Available at:

Discussion Leader: Hisham Benotman

Friday, 27 October 2017 @ 10:00 AM

Title: MILC: Inverted List Compression in Memory

VLDB 2017

Author(s): Jianguo Wang, Chunbin Lin, Ruining He, Moojin Chae, Yannis Papakonstantinou, Steven Swanson

Available at:

Discussion Leader: Basem

Friday, 20 October 2017 @ 10:00 AM

Title: Context-Based Event Processing Systems

SpringerLink: Studies in Computational Intelligence, vol 347. Springer, Berlin, Heidelberg

Author(s): Opher Etzion, Yonit Magid, Ella Rabinovich, Inna Skarbovsky, and Nir Zolotorevsky

Available at:

Discussion Leader: Hong Quach

Friday, 13 October 2017 @ 10:00 AM

Title: NG-DBSCAN: Scalable Density-Based Clustering for Arbitrary Data

VLDB 2017

Author(s): Alessandro Lulli, Matteo Dell'Amico, Pietro Michiardi, Laura Ricc

Available at:

Discussion Leader: Chris

Friday, 06 October 2017 @ 10:00 AM

Title: NoScope: Optimizing Neural Network Queries over Video at Scale

VLDB 2017

Author(s): Daniel Kang, John Emmons, Firas Abuzaid, Peter Bailis, Matei Zaharia

Available at:

Discussion Leader: Dave

Friday, 09 June 2017 @ 10:00 AM

Title: Smart Personalized Routing For Smart Cities

ICDE 2017

Author(s): Abdeltawab M. Hendawi Aqeel Rustum Amr A. Ahmadain David Hazel Ankur Teredesai Dev Oliver Mohamed Ali John A. Stankovic

Available at:

Discussion Leader: Dave Maier

Friday, 02 June 2017 @ 10:00 AM

Title: Fast Queries Over Heterogeneous Data Through Engine Customization

VLDB 2016

Author(s): Manos Karpathiotakis, Ioannis Alagiannis, Anastasia Ailamaki

Available at:

Discussion Leader: Chris

Friday, 19 May 2017 @ 10:00 AM

Title: Efficient Processing of Window Functions in Analytical SQL Queries

VLDB 2015

Author(s): Viktor Leis, Alfons Kemper, Kan Kundhikanjana, and Thomas Neumann

Available at:

Discussion Leader: Hong Quach

Friday, 12 May 2017 @ 10:00 AM

Title: Reactive Vega: A Streaming Dataflow Architecture for Declarative Interactive Visualization

IEEE 2016

Author(s): Arvind Satyanarayan, Ryan Russell, Jane Hoffswell, and Jeffrey Heer

Available at:

Discussion Leader: Basem

Friday, 05 May 2017 @ 10:00 AM

Title: Twitter Heron: Stream Processing at Scale

SIGMOD '15: Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data

Author(s): Sanjeev Kulkarni, Nikunj Bhagat, Maosong Fu, Vikas Kedigehalli, Christopher Kellogg, Sailesh Mittal, Jignesh M. Patel*,Karthik Ramasamy, Siddarth Taneja

Available at:

Discussion Leader: Shreemoyee Sarkar

Friday, 28 April 2017 @ 10:00 AM

Title: Spatial Online Sampling and Aggregation

VLDB 2016

Author(s): Lu Wang, Robert Christensen, Feifei Li, Ke Yi

Available at:

Discussion Leader: Chris

Friday, 21 April 2017 @ 10:00 AM

Title: Studying the Wikipedia Hyperlink Graph for Relatedness and Disambiguation

arXiv 2015

Author(s): Eneko Agirre, Ander Barrena, Aitor Soroa

Available at:

Discussion Leader: Hisham

Friday, 14 April 2017 @ 10:00 AM

Title: SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning

CIDR 2017

Author(s): Tarek Elgamal, Shangyu Luo, Mattias Boehm, Alexandre V. Evfimievski, Shirish Tatikonda, Berthold Reinwald, Prithviraj Sen

Available at:

Discussion Leader: Dave Maier

Friday, 17 March 2017 @ 10:00 AM

Title: SnappyData: A Unified Cluster for Streaming, Transactions and Interactice Analytics

CIDR 2017

Author(s): Barzan Mozafari, Jags Ramnarayan, Sudhir Menon, Yogesh Mahajan, Soubhik Chakraborty, Hemant Bhanawat, Kishor Bachhav

Available at:

Discussion Leader: Dave Maier

Friday, 10 March 2017 @ 10:00 AM

Title: Reducing the storage overhead of main memory OLTP databases with hybrid indexes

SIGMOD�16, June 26-July 01, 2016, San Francisco, CA, USA

Author(s): Huanchen Zhang. David G. Andersen. Andrew Pavlo, Michael Kaminsky, Lin Ma, Rui Shen

Available at:

Discussion Leader: Shreemoyee Sarkar

Friday, 03 March 2017 @ 10:00 AM

Title: Facetedpedia: Dynamic Generation of Query-Dependent Faceted Interfaces for Wikipedia

World wide web 2010

Author(s): Chengkai Li, Ning Yan, Senjuti B. Roy, Lekhendro Lisham, Gautam Das

Available at:

Discussion Leader: Hisham Benotman

Friday, 24 February 2017 @ 10:00 AM

Title: The Myria Big Data Management and Analytics System and Cloud Services

CIDR 2017

Author(s): Jingjing Wang, Tobin Baker, Magda Balazinska, Daniel Halperin, Brandon Hayes, Bill Howe, Dylan Hutchinson, Shrainik Jain, Ryan Maas, Parmita Mehta, Dominik Moritz, Brandon Myers, Jennifer Ortiz, Dan Suciu, Andrew Whittaker, Shengliang Xu

Available at:

Discussion Leader: Dave Maier

Friday, 17 February 2017 @ 10:00 AM

Title: LEOPARD: Lightweight Edge-Oriented Partitioning and Replication for Dynamic Graphs

VLDB 2016

Author(s): Jiewen Huang, Daniel Abadi

Available at:

Discussion Leader: Basem Elazzabi

Friday, 10 February 2017 @ 10:00 AM

Title: Self-driving database management systems

CIDR 2017

Author(s): Pavlo et al.

Available at:

Discussion Leader: Jeremiah Peschka

Friday, 03 February 2017 @ 10:00 AM

Title: SnappyData: A Unified Cluster for Streaming, Transactions and Interactice Analytics

CIDR 2017

Author(s): Barzan Mozafari, Jags Ramnarayan, Sudhir Menon, Yogesh Mahajan, Soubhik Chakraborty, Hemant Bhanawat, Kishor Bachhav

Available at:

Discussion Leader: Dave Maier

Friday, 27 January 2017 @ 10:00 AM

Title: ULDBs: Databases with Uncertainty and Lineage

VLDB '06

Author(s): Omar Benjelloun, Anish Das Sarma, Alon Halevy, Jennifer Widom

Available at:

Discussion Leader: Chris Giossi

Friday, 02 December 2016 @ 10:00 AM

Title: Coordination Avoidance in Database Systems

VLDB 2015 - Proceedings of the VLDB Endowment, Vol. 8, No. 3

Author(s): Peter Bailis, Alan Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica

Available at:

Discussion Leader: Jeremiah Peschka

Friday, 18 November 2016 @ 10:00 AM

Title: The Snowflake Elastic Data Warehouse

SIGMOD '16 Proceedings of the 2016 International Conference on Management of Data Pages 215-226 ACM

Author(s): Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Vadim Antonov, Artin Avanes, Jon Bock, Jonathan Claybaugh, Daniel Engovatov, Martin Hentschel, Jiansheng Huang, Allison W. Lee, Ashish Motivala, Abdul Q. Munir, Steven Pelley, Peter Povinec, Greg Rahn, S

Available at:

Discussion Leader: Shree

Friday, 04 November 2016 @ 10:00 AM

Title: Plenario: An Open Data Discovery and Exploration Platform for Urban Science

IEEE Data Engineering Bulletin, Dec 2014

Author(s): C. Cattlett et al.

Available at:

Discussion Leader: Dave

Friday, 28 October 2016 @ 10:00 AM

Title: Rough sets and intelligent data analysis

Information Sciences 147 (2002) 1�12

Author(s): Zdzisław Pawlak

Available at:

Discussion Leader: Basem

Friday, 21 October 2016 @ 10:00 AM

Title: Seven Databases in Seven Weeks

CMU Seminar series

Author(s): CMU Seminar series

Available at:

Discussion Leader: Dave

Friday, 14 October 2016 @ 10:00 AM

Title: Customized Random Walk for Generating Wikipedia Article Recommendations

Not published

Author(s): Jocelyn Hickcox and Chris Min

Available at:

Discussion Leader: Hisham

Friday, 07 October 2016 @ 10:00 AM

Title: Photon: Fault-tolerant and Scalable Joining of Continuous Data Streams


Author(s): Rajagopal Ananthanarayanan, Venkatesh Basker, Sumit Das, Ashish Gupta, Haifeng Jiang, Tianhao Qiu, Alexey Reznichenko, Deomid Ryabkov, Manpreet Singh, Shivakumar Venkataraman

Available at:

Discussion Leader: Chris

Friday, 27 May 2016 @ 10:00 AM

Title: An Optimization Framework for Map-Reduce Queries

EDBT 2012

Author(s): Leonidas Fegaras, Chengkai Li, Upa Gupta

Available at:

Discussion Leader: David Maier

Friday, 20 May 2016 @ 10:00 AM

Title: Assessing Learning Outcomes in Web Search: A Comparison of Tasks and Query Strategies

CHIIR 2016

Author(s): Kevyn Collins-Thompson, , Soo Young Rieh, , Carl C. Haynes, , Rohail Syed

Available at:

Discussion Leader: Hisham Benotman

Friday, 06 May 2016 @ 10:00 AM

Title: Data Properties Using Abstraction to Enhance the Use of Data in Decision Making

RPE Talk

Author(s): Basem Elazzabi

Available at: Attached

Discussion Leader: Basem Elazzabi

Friday, 29 April 2016 @ 10:00 AM

Title: Equality Saturation: a New Approach to Optimization

POPL 2009

Author(s): Ross Tate Michael Stepp Zachary Tatlock Sorin Lerner

Available at:

Discussion Leader: David Maier

Friday, 22 April 2016 @ 10:00 AM

Title: Can We Analyze Big Data Inside a DBMS?

DOLAP 2013

Author(s): Carlos Ordonez

Available at: "

Discussion Leader: Chris

Friday, 15 April 2016 @ 10:00 AM

Title: SociaLite: An Efficient Graph Query Language Based on Datalog

TKDE July-Aug 2015

Author(s): J. Seo ; Dept. of Comput. Sci., Stanford Univ., Stanford, CA, USA ; S. Guo ; M. S. Lam

Available at:

Discussion Leader: David Maier

Friday, 08 April 2016 @ 10:00 AM

Title: "A Rule-Based Citation System for Structured and Evolving Datasets"

IEEE Data Eng. 2010

Author(s): "Peter Buneman & Gianmaria Silvello "

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 04 March 2016 @ 10:00 AM

Title: Querying and Managing Provenance through User Views in Scientific Workflows

Data Engineering, 2008.

Author(s): Biton, O. Cohen-Boulakia, S. ; Davidson, S.B. ; Hara, C.S.

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 26 February 2016 @ 10:00 AM

Title: The Dataflow Model: A Practical Approach to Balancing Correctness, Latency, and Cost in Massive-Scale, Unbounded, Out-of-Order Data Processing

VLDB 2015

Author(s): Tyler Akidau, Robert Bradshaw, Craig Chambers, Slava Chernyak, Rafael J. Fernandez-Moctezuma, Reuven Lax, Sam McVeety, Daniel Mills, Frances Perry, Eric Schmidt, Sam Whittle

Available at:

Discussion Leader: David Maier

Friday, 19 February 2016 @ 10:00 AM

Title: Entity ranking in Wikipedia

SAC '08 Proceedings of the 2008 ACM symposium on Applied computing

Author(s): Anne-Marie Vercoustre, James A. Thom, Jovan Pehcevski

Available at:

Discussion Leader: Hisham Benotman

Friday, 12 February 2016 @ 10:00 AM

Title: Orleans: Distributed Virtual Actors for Programmability and Scalability


Author(s): Philip A. Bernstein, Sergey Bykov, Alan Geller, Gabriel Kliot, and Jorgen Thelin

Available at:

Discussion Leader: David Maier

Friday, 05 February 2016 @ 10:00 AM

Title: An Architecture for Compiling UDF-centric Workflows

VLDB 2015

Author(s): Andrew Crotty, Alex Galaktos, Kayhan Dursun, Tim Kraska, Carsten Binnig, Ugur Cetintemel, Stan Zdonik

Available at:

Discussion Leader: Chris

Friday, 22 January 2016 @ 10:00 AM

Title: Combining Dependent Annotations for Relational Algebra

ICDT 2012

Author(s): Egor V. Kostylev, Peter Buneman

Available at:

Discussion Leader: Basem Elazzabi

Friday, 04 December 2015 @ 10:00 AM

Title: Reducing Implicit Racial Preferences: I. A Comparative Investigation of 17 Interventions

Journal of Experimental Psychology


Available at:

Discussion Leader: Lois Delcambre

Friday, 20 November 2015 @ 10:00 AM

Title: Supervised Meta-blocking


Author(s): George Papadakis, George Papastefanatos, Georgia Koutrika

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 13 November 2015 @ 10:00 AM

Title: Serving DBpedia with DOLCE – More than Just Adding a Cherry on Top


Author(s): Heiko Paulheim and Aldo Gangemi

Available at:

Discussion Leader: Scott Britell

Friday, 06 November 2015 @ 10:00 AM

Title: RPE Talk

Author(s): Hisham Benothman

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Hisham Benotman

Friday, 30 October 2015 @ 10:00 AM

Title: Musketeer: all for one, one for all in data processing systems

EuroSys 15

Author(s): Gog, Schwarzkopf, Crook, Grosvenor, Clement, Hand

Available at:

Discussion Leader: David Maier

Friday, 16 October 2015 @ 10:00 AM

Title: MillWheel: Fault-Tolerant Stream Processing at Internet Scale

VLDB 2015

Author(s): Tyler Akidau, Alex Balikov, Kaya Bekiroglu,Slava Chernyak, Josh Haberman, Reuven Lax,Sam McVeety, Daniel Mills, Paul Nordstrom,Sam Whittle

Available at:

Discussion Leader: Christopher Giossi

Friday, 09 October 2015 @ 10:00 AM

Title: Data-centric iteration in dynamic workflows

Elsevier - Future Generation Computer Systems The International Journal of eScience

Author(s): Jonas Diasa, , Gabriel Guerraa, , Fernando Rochinhaa, , Alvaro L.G.A. Coutinhoa, , Patrick Valduriezb, , Marta Mattoso

Available at:

Discussion Leader: Basem Elazzabi

Thursday, 10 September 2015 @ 10:00 AM

Title: Data-centric iteration in dynamic workflows

Elsevier - Future Generation Computer Systems The International Journal of eScience

Author(s): Jonas Diasa, Gabriel Guerraa, Fernando Rochinhaa , Alvaro L.G.A. Coutinhoa, Patrick Valduriezb, Marta Mattoso

Available at:

Discussion Leader: Basem Elazzabi

Friday, 05 June 2015 @ 10:00 AM

Title: Sample-Driven Schema Mapping

SIGMOD �12, May 20�24, 2012, Scottsda le, Arizona, USA

Author(s): Li Qian, Michael J. Cafarella, H. V. Jagadish

Available at:

Discussion Leader: Basem Elazzabi

Friday, 29 May 2015 @ 10:00 AM

Title: Content Knowledge for Teaching What Makes It Special?

Journal of Teacher Education Volume 59 Number 5 November/December 2008

Author(s): Deborah Loewenberg Ball, Mark Hoover Thames, and Geoffrey Phelps

Available at:

Discussion Leader: Lois Delcambre

Friday, 22 May 2015 @ 10:00 AM

Title: Hongkong International Terminals Gains Elastic Capacity Using a Data-Intensive Decision-Support System

Interfaces 35(1), pp. 61�75, � 2005 INFORMS

Author(s): K. G. Murty, et al.

Available at:

Discussion Leader: David Maier

Friday, 15 May 2015 @ 10:00 AM

Title: Schema-free SQL

SIGMOD�14, June 22�27, 2014, Snowbird, UT, USA

Author(s): Fei Li, Tianyin Pan, H. V. Jagadish

Available at:

Discussion Leader: Basem Elazzabi

Friday, 08 May 2015 @ 10:00 AM

Title: Multiple Diagram Navigation MDN (RPE Talk)

Author(s): Hisham Benothman

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Hisham Benotman

Friday, 01 May 2015 @ 10:00 AM

Title: A semantic approach to data translation: A case study of environmental observations data

Knowledge-Based Systems 75 (2015) 104�123

Author(s): Yanfeng Shu , David Ratcliffe , Michael Compton , Geoffrey Squire , Kerry Taylor

Available at: Attached

Discussion Leader: Veronika Megler

Friday, 24 April 2015 @ 10:00 AM

Title: Making State Explicit for Imperative Big Data Processing

Visualization and Computer Graphics, IEEE Transactions on (Volume:20 , Issue: 12 )

Author(s): Raul Castro Fernandez, Matteo Migliavacca†, Evangelia Kalyvianaki , Peter Pietzuch

Available at:

Discussion Leader: David Maier

Friday, 17 April 2015 @ 10:00 AM

Title: TBD


Author(s): TBD

Available at: TBD

Discussion Leader: TBD

Friday, 10 April 2015 @ 10:00 AM

Title: Predictive Interaction for Data Transformation

CIDR 2015

Author(s): Jeffery Heer, Joseph Hellerstein, and Sean Kandel

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 13 March 2015 @ 10:00 AM


CIDR 2015

Author(s): Theodore Johnson (AT&T Labs – Research); Vladislav Shkapenyuk (AT&T Labs – Research)

Available at:

Discussion Leader: David Maier

Friday, 06 March 2015 @ 10:00 AM

Title: Assigning Search Tasks Designed to Elicit Exploratory Search Behaviors

Proceedings of the Symposium on Human-Computer Interaction and Information Retrieval. ACM, 2012

Author(s): Barbara M. Wildemuth & Luanne Freund

Available at:

Discussion Leader: Hisham Benotman

Friday, 27 February 2015 @ 12:00 AM

Title: CROWDMAP: Crowdsourcing Ontology Alignment with Microtasks

ISWC 2012

Author(s): Cristina Sarasua,, Elena Simperl, and Natalya F. Noy

Available at:

Discussion Leader: Lois Delcambre

Friday, 20 February 2015 @ 10:00 AM


CIDR 2015

Author(s): Anant Bhardwaj (MIT);  Souvik Bhattacherjee (U. Maryland); Amit Chavan (U. Maryland); Amol Deshpande(U. Maryland); Aaron J. Elmore (MIT & U. Chicago); Samuel Madden (MIT); Aditya Parameswaran (MIT & U. Illinois)

Available at:

Discussion Leader: Basem Elazzabi

Friday, 13 February 2015 @ 10:00 AM

Title: Analyzing

ISWC 2014

Author(s): Peter F. Patel-Schneider

Available at:

Discussion Leader: Lois and Scott

Friday, 13 February 2015 @ 10:00 AM

Title: Deployment of RDFa, Microdata, and Microformats on the Web – A Quantitative Analysis

ISWC 2013

Author(s): Christian Bizer, Kai Eckert, Robert Meusel, Hannes Mühleisen, Michael Schuhmacher, andJohanna Völker

Available at:

Discussion Leader: Lois and Scott

Friday, 06 February 2015 @ 10:00 AM


CIDR 2015

Author(s): Andrew Crotty (Brown University); Alex Galakatos (Brown University); Kayhan Dursun (Brown University); Tim Kraska (Brown University); Ugur Cetintemel (Brown  University); Stan Zdonik  (Brown University)

Available at:

Discussion Leader: David Maier

Friday, 30 January 2015 @ 10:00 AM

Title: Biperpedia: An Ontology for Search Applications

2014 VLDB

Author(s): Rahul Guptay Alon Halevyy Xuezhi Wangx Steven Euijong Whangy Fei Wuy

Available at:

Discussion Leader: Veronika Megler

Friday, 23 January 2015 @ 10:00 AM

Title: Explass: Exploring Associations between Entities via Top-K Ontological Patterns and Facets

ISWC 2014

Author(s): Gong Cheng, Yanan Zhang and Yuzhong Qu

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Scott Britell

Friday, 16 January 2015 @ 10:00 AM

Title: Privacy-preserving record linkage using Bloom filters

BMC Medical Informatics and Decision Making 2009, 9:41

Author(s): Rainer Schnell, Tobias Bachteler and Jörg Reiher

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 05 December 2014 @ 10:00 AM

Title: Detection, Simulation and Elimination of Semantic Anti-patterns in Ontology-Driven Conceptual Models

ER 2014

Author(s): Giancarlo Guizzardi, Tiago Prince Sales

Available at:

Discussion Leader: Scott Britell

Friday, 21 November 2014 @ 10:00 AM

Title: Towards Integrating the Detection of Genetic Variants into an In-Memory Database

2014 IEEE International Conference on Big Data

Author(s): Cindy Fähnrich, Matthieu-P. Schapranow, Hasso Plattner

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Patrick Leyshock

Friday, 14 November 2014 @ 10:00 AM

Title: Uniform access to NoSQL systems

Inf. Syst. (IS) 43:117-133 (2014)

Author(s): Paolo Atzeni, Francesca Bugiotti, Luca Rossi

Available at:

Discussion Leader: Paolo Atzeni

Friday, 07 November 2014 @ 10:00 AM

Title: A runtime approach to model-generic translation of schema and data

Information Systems Volume 37 Issue 3, May, 2012

Author(s): Paolo Atzeni, Luigi Bellomarini, Francesca Bugiotti, Fabrizio Celli, and Giorgio Gianforme.

Available at:

Discussion Leader: Lois Delcambre

Friday, 31 October 2014 @ 10:00 AM

Title: Kinds of contexts and their impact on semantic similarity measurement

Pervasive Computing and Communications 2008

Author(s): Krzysztof Janowicz

Available at:

Discussion Leader: Veronika Megler

Friday, 24 October 2014 @ 10:00 AM

Title: Exploring the Design Space of Composite Visualization

Pacific Visualization Symposium (PacificVis), 2012 IEEE

Author(s): Waqas Javed & Niklas Elmqvist

Available at:

Discussion Leader: Hisham Benotman

Friday, 10 October 2014 @ 10:00 AM

Title: The Trill Incremental Analytics Engine


Author(s): Badrish Chandramouli, Jonathan Goldstein, Mike Barnett, Robert DeLine, Danyel Fisher, John C. Platt, James F. Terwilliger, and John Wernsing

Available at:

Discussion Leader: David Maier

Friday, 06 June 2014 @ 10:00 AM

Title: Table extraction using conditional random fields

SIGIR '03 Proceedings

Author(s): David Pinto, Andrew McCallum, Xing Wei and W. Bruce Croft

Available at:

Discussion Leader: Done Hertel

Friday, 30 May 2014 @ 10:00 AM

Title: Shark: SQL and Rich Analytics at Scale

Proceedings of the 2013 international conference on Management of data.

Author(s): R. Xin, J. Rosen, M. Zaharia, M. Franklin, S. Shenker, I. Stoica

Available at:

Discussion Leader: Patrick Leyshock

Friday, 23 May 2014 @ 10:00 AM

Title: The Conceptual Model ≡ An Adequate and Dependable Artifact Enhanced by Concepts

Info Modeling & Knowledge Bases, IOS Press, 2014

Author(s): Bernhard Thalheim

Available at: TBA

Discussion Leader: Lois Delcambre

Friday, 09 May 2014 @ 10:00 AM

Title: Automatic Web Spreadsheet Data Extraction

VLDB Workshop on Semantic Search over the Web, Trento, Italy. 2013

Author(s): Zhe Chen, Michael Cafarella

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 02 May 2014 @ 10:00 AM

Title: The Bohemian Bookshelf: Supporting Serendipitous Book Discoveries through Information Visualization

CHI '12 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems 2012

Author(s): Alice Thudt, Uta Hinrichs and Sheelagh Carpendale

Available at:

Discussion Leader: Hisham Benotman

Friday, 25 April 2014 @ 10:00 AM

Title: Schema exchange: Generic mappings for transforming data and metadata

Data & Knowledge Engineering Volume 68 Issue 7, July, 2009

Author(s): Paolo Papotti and Riccardo Torlone

Available at:

Discussion Leader: Scott Britell

Friday, 18 April 2014 @ 10:00 AM

Title: Towards the web of concepts: extracting concepts from large datasets

VLDB 2010

Author(s): Aditya Parameswaran, Hector Garcia-Molina, Anand Rajaraman

Available at:

Discussion Leader: Veronik Megler

Friday, 11 April 2014 @ 10:00 AM

Title: Data Curation at Scale: The Data Tamer System

CIDR 2013

Author(s): Michael Stonebraker (MIT); Daniel Bruckner (UC Berkeley); Ihab Ilyas (QCRI); George Beskales (QCRI); Mitch Cherniack (Brandeis University); Stan Zdonik (Brown University); Alexander Pagan (MIT); Shan Xu (Verisk Analytics)

Available at:

Discussion Leader: David Maier

Friday, 14 March 2014 @ 10:00 AM

Title: Scalable Anomaly Detection for Smart City Infrastructure Networks

Internet Computing, IEEE (Volume:17 , Issue: 6 )

Author(s): Difallah, Djellel Eddine Cudre-Mauroux, Philippe ; McKenna, Sean A.

Available at:

Discussion Leader: Veronika Megler

Friday, 07 March 2014 @ 10:00 AM

Title: Do Graphical Search Interfaces Support Effective Search for and Evaluation of Digital Library Resources?


Author(s): Kirsten R. Butcher, Sarah Davies, Ashley Crockett, Aaron Dewald, Robert Zheng

Available at:

Discussion Leader: Hisham Benotman

Friday, 28 February 2014 @ 10:00 AM

Title: Profiling, What-if Analysis, and Cost-Based Optimization of MapReduce Programs

VLDB 2011

Author(s): H. Herodotou & S. Babu

Available at:

Discussion Leader: Patrick Leyshock

Friday, 21 February 2014 @ 10:00 AM

Title: Scientific Data Management in the Coming Decade

Microsoft Research Tech. Report 2005

Author(s): Jim Gray, David T. Liu, Maria Nieto-Santisteban, Alexander S. Szalay, David DeWitt and Gerd Heber

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 14 February 2014 @ 10:00 AM

Title: Visual Cluster Exploration of Web Clickstream Data

IEEE Symposium on Visual Analytics Science and Technology 2012

Author(s): Jishang Wei,et. al.

Available at:

Discussion Leader: Dona Hertel

Friday, 07 February 2014 @ 10:00 AM

Title: Rank and Relevance in Novelty and Diversity Metrics


Author(s): Saúl Vargas and Pablo Castells

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 31 January 2014 @ 10:00 AM

Title: NoizCrowd: A Crowd-Based Data Gathering and Management System for Noise Level Data

Mobile Web Information Systems Lecture Notes in Computer Science Volume 8093, 2013, pp 172-186

Author(s): Mariusz Wisniewski, Gianluca Demartini, Apostolos Malatras, Philippe Cudré-Mauroux

Available at:

Discussion Leader: David Maier

Friday, 24 January 2014 @ 10:00 AM

Title: Extending relational query optimization to dynamic schemas for information integration in multidatabases


Author(s): Catharine M. Wyss and Felix I. Wyss.

Available at:

Discussion Leader: Scott Britell

Friday, 17 January 2014 @ 10:30 AM

Title: Reexamining the Cluster Hypothesis: Scatter/Gather on Retrieval Results

Proceedings of the Nineteenth Annual International ACM SIGIR Conference, Zurich, June 1996.

Author(s): Marti A. Hearst and Jan O. Pedersen

Available at: also in ACM digital library

Discussion Leader: Lois Delcambre

Friday, 17 January 2014 @ 10:00 AM

Title: The cluster hypothesis revisited

SIGIR 1985

Author(s): Voorhees E.

Available at:

Discussion Leader: Lois Delcambre

Friday, 10 January 2014 @ 10:00 AM

Title: Paper Choosing Session


Available at: #

Discussion Leader:

Friday, 10 January 2014 @ 12:00 AM

Title: Paper Choosing Session


Available at: #

Discussion Leader:

Friday, 06 December 2013 @ 10:00 AM

Title: Rank and relevance in novelty and diversity metrics for recommender systems

Author(s): Saúl Vargas and Pablo Castells

Available at:

Discussion Leader: Dona Hertel

Friday, 22 November 2013 @ 10:00 AM

Title: GenBase: A Benchmark for the Genomics Era

Author(s): Pradeep Dubey, Nadathur Satish, Narayanan Sundaram, Sam Madden, Mike Stonebraker, Rebecca Taft and Manasi Vartak

Available at: N/A

Discussion Leader: Patrick Leyshock

Friday, 15 November 2013 @ 10:00 AM

Title: DBpedia - A Crystallization Point for the Web of Data

Author(s): Christian Bizer , Jens Lehmann , Georgi Kobilarov, Soren Auer, Christian Becker, Richard Cyganiak, Sebastian Hellmann

Available at:

Discussion Leader: Hisham Benotman

Friday, 08 November 2013 @ 10:00 AM

Title: Tracking Trash

Author(s): Santi Phithakkitnukoon, Malima I. Wolf, Dietmar Offenhuber, David Lee, Assaf Biderman, Carlo Ratti

Available at:

Discussion Leader: David Maier

Friday, 01 November 2013 @ 10:00 AM

Title: Tuning Large Scale Deduplication with Reduced Effort

Author(s): Guilherme Dal Bianco, Renata Galante,Carlos A. Heuser, and Marcos André Gonçalves

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 25 October 2013 @ 10:00 AM

Title: Item popularity and recommendation accuracy

Author(s): Harald Steck

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 18 October 2013 @ 10:00 AM

Title: From Personal Desktops to Personal Dataspaces: A Report on Building the iMeMex Personal Dataspace Management System

Author(s): Jens-Peter Dittrich Lukas Blunschi Markus Färber Olivier René Girard Shant Kirakos Karakashian Marcos Antonio Vaz Salles

Available at:

Discussion Leader: Veronika Megler

Friday, 11 October 2013 @ 10:00 AM

Title: Semi-Automatically Mapping Structured Sources into the Semantic Web

Author(s): Craig A. Knoblock, Pedro Szekely, José Luis Ambite, Aman Goel, Shubham Gupta, Kristina Lerman, Maria Muslea, Mohsen Taheriyan, Parag Mallick

Available at:

Discussion Leader: Scott Britell

Friday, 28 June 2013 @ 10:00 AM

Title: Paper-choosing Session


Available at: #

Discussion Leader:

Friday, 07 June 2013 @ 10:00 AM

Title: Data cleaning: Problems and current approaches

IEEE Data Engineering Bulletin 2000

Author(s): Rahm, Erhard and Do, and Hong Hai

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 31 May 2013 @ 10:00 AM

Title: Stability of Recommendation Algorithms

TOIS 12 vol 4

Author(s): Adomavicius and Zhang

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 10 May 2013 @ 10:00 AM

Title: Schema mediation in peer data management systems

ICDE 2003

Author(s): Halevy, A.Y.; Ives, Z.G.; Suciu, D.; Tatarinov, I.

Available at:

Discussion Leader: Veronika Megler

Friday, 03 May 2013 @ 10:00 AM

Title: A survey of query-by-humming similarity methods

PETRA 2012

Author(s): Kotsifakos, Alexios et. al.

Available at:

Discussion Leader: Dona Hertel

Friday, 26 April 2013 @ 10:00 AM

Title: Identifying Relationships between Spreadsheets


Author(s): Abdussalam Alawini

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 19 April 2013 @ 10:00 AM

Title: SociaLite: Datalog Extensions for Efficient Social Network Analysis

ICDE 2013

Author(s): Jiwon Seo (Stanford), Stephen Guo (Stanford), Monica Lam (Stanford)

Available at:

Discussion Leader: Scott Britell

Friday, 12 April 2013 @ 10:00 AM

Title: Constructivism in Computer Science Education


Author(s): Mordechai Ben-Ari

Available at:

Discussion Leader: Lois Delcambre

Friday, 12 April 2013 @ 10:00 AM

Title: Why Minimal Guidance During Instruction Does Not Work; An Analysis of the Failure of Constructivist, Discvoery, Problem-Based, Experiential, and Inquiry-based Teaching

Educational Psychologist, 41:2, 75-86, 2006

Author(s): Paul A. Kirschner, John Sweller, and Richard E. Clark

Available at:

Discussion Leader: Lois Delcambre

Friday, 05 April 2013 @ 11:00 AM

Title: Paper-choosing Session


Available at: #

Discussion Leader:

Thursday, 14 March 2013 @ 2:00 PM

Title: The Clio project: managing heterogeneity

SIGMOD Record 2001

Author(s): Renée J. Miller, Mauricio A. Hernández, Laura M. Haas, Lingling Yan, C. T. Howard Ho, Ronald Fagin, and Lucian Popa

Available at:

Discussion Leader: Dona Hertel

Thursday, 28 February 2013 @ 2:00 PM

Title: Incorporating variability in user behavior into systems based evaluation

CIKM 2012

Author(s): Ben Carterette, Evangelos Kanoulas, and Emine Yilmaz

Available at:

Discussion Leader: Jeremy Steinhauer

Thursday, 14 February 2013 @ 2:00 PM

Title: Observation-Driven Geo-Ontology Engineering

Transactions in GIS 2012

Author(s): Krzysztof Janowicz

Available at:

Discussion Leader: Scott Britell

Thursday, 07 February 2013 @ 2:00 PM

Title: Efficient classification across multiple database relations: a CrossMine approach

IEEE Transactions on Knowledge and Data Engineering 2006

Author(s): Yin, X.; Han, J.; Yang, J.; Yu, P.S.

Available at:

Discussion Leader: Abdussalam Alawini

Thursday, 31 January 2013 @ 2:00 PM

Title: Validating Multi-column Schema Matchings by Type

ICDE 2008

Author(s): Bing Tian Dai; Koudas, N.; Srivastava, D.; Tung, A.K.H.; Venkatasubramanian, S.

Available at:

Discussion Leader: Lois Delcambre

Thursday, 24 January 2013 @ 2:00 PM

Title: Matching unstructured product offers to structured product specifications


Author(s): Anitha Kannan, Inmar E. Givoni, Rakesh Agrawal, and Ariel Fuxman

Available at:

Discussion Leader: Veronika Megler

Thursday, 17 January 2013 @ 2:00 PM

Title: Automatic partitioning of database applications

VLDB 2012

Author(s): Alvin Cheung, Samuel Madden, Owen Arden, and Andrew C. Myers

Available at:

Discussion Leader: Patrick Leyshock

Friday, 11 January 2013 @ 10:00 AM

Title: Paper-choosing Session


Available at: #

Discussion Leader:

Friday, 30 November 2012 @ 10:00 AM

Title: Contextualized knowledge repositories for the Semantic Web

We propose Contextualized Knowledge Repository (CKR): an adaptation of the well studied theories of context for the Semantic Web. A CKR is composed of a set of OWL 2 knowledge bases, which are embedded in a context by a set of qualifying attributes (time, space, topic, etc.) specifying the boundaries within which the knowledge base is assumed to be true. Contexts of a CKR are organized by a hierarchical coverage relation, which enables an effective representation of knowledge and a flexible method for its reuse between the contexts. The paper defines the syntax and the semantics of CKR; shows that concept satisfiability and subsumption are decidable with the complexity upper bound of 2NExpTime, and it also provides a sound and complete natural deduction calculus that serves to characterize the propagation of knowledge between contexts.

Author(s): Luciano Serafini, Martin Homola

Available at:

Discussion Leader: Lois Delcambre

Friday, 16 November 2012 @ 10:00 AM

Title: Spanner: Google's Globally-Distributed Database

Spanner is Google's scalable, multi-version, globally-distributed, and synchronously-replicated database. It is the first system to distribute data at global scale and support externally-consistent distributed transactions. This paper describes how Spanner is structured, its feature set, the rationale underlying various design decisions, and a novel time API that exposes clock uncertainty. This API and its implementation are critical to supporting external consistency and a variety of powerful features: non-blocking reads in the past, lock-free read-only transactions, and atomic schema changes, across all of Spanner.

Author(s): James C. Corbett, Jeffrey Dean, Michael Epstein, Andrew Fikes, Christopher Frost, JJ Furman, Sanjay Ghemawat, Andrey Gubarev, Christopher Heiser, Peter Hochschild, Wilson Hsieh, Sebastian Kanthak, Eugene Kogan, Hongyi Li, Alexander Lloyd, Sergey Melnik, D

Available at:

Discussion Leader: Bryon Nevis

Friday, 09 November 2012 @ 10:00 AM

Title: Multidimensional Integrated Ontologies: A Framework for Designing Semantic Data Warehouses

The Semantic Web enables companies and organizations to gather huge amounts of valuable semantically annotated data concerning their subjects of interest. Nowadays, many applications attach metadata and semantic annotations taken from domain and application ontologies to the information they generate. From our point of view, the concepts in these ontologies could describe the facts, dimensions, categories and values implied in the analysis subjects of a data warehouse. In this paper we propose the Semantic Data Warehouse to be a repository of ontologies and semantically annotated data resources. We also propose an ontology-driven framework to design multidimensional analysis models for Semantic Data Warehouses. This framework provides means for building an integrated ontology, called the Multidimensional Integrated Ontology (MIO), including the classes, relationships and instances that represent interesting analysis dimensions and measures. The reasoning capabilities of a MIO can be used to check the properties required by current multidimensional databases (e.g., dimension orthogonality, category satisfiability, etc.). In this paper we also sketch how the instance data of a MIO can be translated into OLAP cubes for analysis purposes. Finally, some implementation issues of the overall framework are discussed. Keywords: Data warehouses, Semantic Web, Multi-ontology integration 1.

Author(s): Victoria Nebot, Rafael Berlanga, Juan Manuel Pérez, María José Aramburu und Torben Bach Pedersen

Available at:

Discussion Leader: ChristopheSchuetz

Friday, 02 November 2012 @ 10:00 AM

Title: Time-based calibration of effectiveness measures

Many current effectiveness measures incorporate simplifying assumptions about user behavior. These assumptions prevent the measures from reflecting aspects of the search process that directly impact the quality of retrieval results as experienced by the user. In particular, these measures implicitly model users as working down a list of retrieval results, spending equal time assessing each document. In reality, even a careful user, intending to identify as much relevant material as possible, must spend longer on some documents than on others. Aspects such as document length, duplicates and summaries all influence the time required. In this paper, we introduce a time-biased gain measure, which explicitly accommodates such aspects of the search process. By conducting an appropriate user study, we calibrate and validate the measure against the TREC 2005 Robust Track test collection. We examine properties of the measure, contrasting it to traditional effectiveness measures, and exploring its extension to other aspects and environments. As its primary benefit, the measure allows us to evaluate system performance in human terms, while maintaining the simplicity and repeatability of system-oriented tests. Overall, we aim to achieve a clearer connection between user-oriented studies and system-oriented tests, allowing us to better transfer insights and outcomes from one to the other.

Author(s): Mark Smucker, Charles Clarke

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 26 October 2012 @ 10:00 AM

Title: Human-Powered Sorts and Joins

Crowdsourcing markets like Amazon�s Mechanical Turk (MTurk) make it possible to task people with small jobs, such as labeling images or looking up phone numbers, via a programmatic interface. MTurk tasks for processing datasets with humans are currently designed with significant reimplementation of common workflows and ad-hoc selection of parameters such as price to pay per task. We describe how we have integrated crowds into a declarative workflow engine called Qurk to reduce the burden on workflow designers. In this paper, we focus on how to use humans to compare items for sorting and joining data, two of the most common operations in DBMSs. We describe our basic query interface and the user interface of the tasks we post to MTurk. We also propose a number of optimizations, including task batching, replacing pairwise comparisons with numerical ratings, and pre-filtering tables before joining them, which dramatically reduce the overall cost of running sorts and joins on the crowd. In an experiment joining two sets of images, we reduce the overall cost from $67 in a naive implementation to about $3, without substantially affecting accuracy or latency. In an end-to-end experiment, we reduced cost by a factor of 14:5.

Author(s): Adam Marcus, Eugene Wu, David Karger, Samuel Madden, Robert Miller

Available at:

Discussion Leader: Scott Britell

Friday, 19 October 2012 @ 10:00 AM

Title: SheetDiff: A Tool for Identifying Changes in Spreadsheets

2010 IEEE Symposium on Visual Languages and Human-Centric Computing

Author(s): Chambers, C., Erwig, M., & Luckey, M.

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 28 September 2012 @ 10:00 AM

Title: Paper-choosing Session


Available at: #

Discussion Leader:

Friday, 31 August 2012 @ 10:00 AM

Title: Data Cube: A Relational Aggregation Operator Generalizing Group-By, Cross-Tab, and Sub-Totals

Data Mining and Knowledge Discovery 1997

Author(s): Jim Gray, Surajit Chaudhuri, Adam Bosworth, Andrew Layman, Don Reichart, Murali Venkatrao, Frank Pellow, and Hamid Pirahesh

Available at:

Discussion Leader: Veronika Megler

Friday, 24 August 2012 @ 10:00 AM

Title: OPTICS: ordering points to identify the clustering structure


Author(s): Mihael Ankerst, Markus M. Breunig, Hans-Peter Kriegel, and Jörg Sander

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 17 August 2012 @ 10:00 AM

Title: Change patterns and change support features – Enhancing flexibility in process-aware information systems

DKE 2008

Author(s): Barbara Weber, Manfred Reichert, Stefanie Rinderle-Ma

Available at:

Discussion Leader: Christoph Schuetz

Friday, 10 August 2012 @ 10:00 AM

Title: A Semantic Approach to Discovering Schema Mapping Expressions

ICDE 2007

Author(s): An, Y.,Borgida, A.,Miller, R.J. and Mylopoulos, J.

Available at:

Discussion Leader: Lois Delcambre

Friday, 03 August 2012 @ 10:00 AM

Title: NoDB: efficient query execution on raw data files


Author(s): Ioannis Alagiannis, Renata Borovica, Miguel Branco, Stratos Idreos, and Anastasia Ailamaki

Available at:

Discussion Leader: Scott Britell

Friday, 20 July 2012 @ 10:00 AM

Title: The design of the multitenant internet application development platform


Author(s): Craig D. Weissman and Steve Bobrowski

Available at:

Discussion Leader: Bryon Nevis

Friday, 13 July 2012 @ 10:00 AM

Title: Fuzzy querying of incomplete, imprecise, and heterogeneously structured data in the relational model using ontologies and rules

IEEE Transactions on Fuzzy Systems, Volume 13, Issue 3

Author(s): Buche, P., Dervin, C., Haemmerle, O., Thomopoulos, R.

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 06 July 2012 @ 10:00 AM

Title: Temporal Analytics on Big Data for Web Advertising

ICDE 2012

Author(s): Badrish Chandramouli, Jonathan Goldstein, and Songyun Duan

Available at:

Discussion Leader: David Maier

Friday, 29 June 2012 @ 10:00 AM

Title: Paper-choosing Session


Available at: #

Discussion Leader:

Friday, 08 June 2012 @ 10:00 AM

Title: Techniques for Efficiently Querying Scientific Workflow Provenance Graphs

EDBT 2010

Author(s): Manish Kumar Anand, Shawn Bowers, Bertram Lud?scher

Available at:

Discussion Leader: Lois Delcambre

Friday, 01 June 2012 @ 10:00 AM

Title: Searching with Numbers

WWW 2002

Author(s): Rakesh Agrawal and Ramakrishnan Srikant

Available at:

Discussion Leader: Veronika Megler

Friday, 25 May 2012 @ 10:00 AM

Title: Storing Matrices on Disk: Theory and Practice Revisited

VLDB 2011

Author(s): Yi Zhang, Kamesh Munagala, and Jun Yang

Available at:

Discussion Leader: Patrick Leyshock

Friday, 18 May 2012 @ 10:00 AM

Title: Uniform Access to Non-Relational Database Systems: the SOS Platform

CAiSE 2012

Author(s): Atzeni, Bugiotti, Rossi

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Michael Grossniklaus

Friday, 11 May 2012 @ 10:00 AM

Title: No reading group this week - RPE presentations in FAB 86-01


Available at: #

Discussion Leader:

Friday, 04 May 2012 @ 10:00 AM

Title: OLAP query reformulation in peer-to-peer data warehousing

Information Systems, Volume 37, Issue 5, July 2012

Author(s): M. Golfarelli, F. Mandreoli, W. Penzo, S. Rizzi, E. Turricchia

Available at:

Discussion Leader: Christoph Schuetz

Friday, 27 April 2012 @ 10:00 AM

Title: Graph Pattern Matching: A Join/Semijoin Approach

TKDE Volume: 23 Issue:7 2011

Author(s): Jiefeng Cheng, Jeffrey Xu Yu, and Philip S. Yu

Available at:

Discussion Leader: Dave Maier

Friday, 20 April 2012 @ 10:00 AM

Title: Recovering Semantics of Tables on the Web

VLDB 2011

Author(s): Petros Venetis, Alon Halevy, Jayant Madhavan, Marius Pa?ca, Warren Shen, Fei Wu, Gengxin Miao, and Chung Wu

Available at:

Discussion Leader: Alon Halevy

Friday, 13 April 2012 @ 10:00 AM

Title: No reading group this week - Faculty Candidate Talk in FAB 86-01


Available at:

Discussion Leader:

Friday, 06 April 2012 @ 10:00 AM

Title: Paper Choosing Session - Talk by Christoph Schuetz


Available at: #

Discussion Leader:

Friday, 23 March 2012 @ 10:00 AM

Title: An empirical characterization of stream programs and its implications for language and compiler design

PACT 2010

Author(s): William Thies and Saman Amarasinghe

Available at:

Discussion Leader: Kristin Tufte

Friday, 16 March 2012 @ 10:00 AM

Title: From Spreadsheets to Relational Databases and Back

2009 ACM SIGPLAN workshop on Partial evaluation and program manipulation

Author(s): Jacome Cunha, Joao Saraiva, and Joost Visser

Available at:

Discussion Leader: Abdussalam Alawini

Friday, 09 March 2012 @ 10:00 AM

Title: Model-driven Development of Context-Aware Web Applications

TOIT Volume 7, Number 1, February 2007

Author(s): Stefano Ceri, Florian Daniel, Maristella Matera, Federico M. Facca

Available at:

Discussion Leader: Scott Britell

Friday, 02 March 2012 @ 10:00 AM

Title: Understanding Queries in a Search Database System

PODS 2010

Author(s): Ronald Fagin , Benny Kimelfeld , Yunyao Li , Sriram Raghavan

Available at:

Discussion Leader: Veronika Megler

Friday, 24 February 2012 @ 10:00 AM

Title: Fast and accurate estimation of shortest paths in large graphs

CIKM 2010

Author(s): Andrey Gubichev, Srikanta Bedathur, Stephan Seufert, and Gerhard Weikum

Available at:

Discussion Leader: Dave Maier

Friday, 17 February 2012 @ 10:00 AM

Title: Understanding digital library adoption: a use diffusion approach

JCDL 2011

Author(s): Keith E. Maull, Manuel Gerardo Saldivar, and Tamara Sumner

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 10 February 2012 @ 10:00 AM

Title: Scalable SPARQL Querying of Large RDF Graphs

VLDB 2011

Author(s): Jiewen Huang, Daniel J. Abadi, and Kun Ren

Available at:

Discussion Leader: Michael Grossniklaus

Friday, 03 February 2012 @ 10:00 AM

Title: Databases will Visualize Queries too

VLDB 2011

Author(s): W. Gatterbauer

Available at:

Discussion Leader: Len Shapiro

Friday, 03 February 2012 @ 10:00 AM

Title: Using Data for Systemic Financial Risk Management

CIDR 2011

Author(s): Mark Flood, HV Jagadish, Albert Kyle, Frank Olken, and Louiqa Raschid

Available at:

Discussion Leader: Len Shapiro

Friday, 03 February 2012 @ 10:00 AM

Title: Computational Journalism: A Call to Arms to Database Researchers

CIDR 2011

Author(s): Sarah Cohen, Chengkai Li, Jun Yang, and Cong Yu

Available at:

Discussion Leader: Len Shapiro

Friday, 27 January 2012 @ 10:00 AM

Title: Ricardo: integrating R and Hadoop


Author(s): Sudipto Das, Yannis Sismanis, Kevin S. Beyer, Rainer Gemulla, Peter J. Haas, and John McPherson

Available at:;jsessionid=15C748C4FB9FA6A622CCB68422389003?doi=

Discussion Leader: Patrick Leyshock

Friday, 20 January 2012 @ 10:00 AM

Title: Towards a pattern science for the Semantic Web

Semantic Web Volume 1, Number 1-2 / 2010

Author(s): Aldo Gangemi and Valentina Presutti

Available at:

Discussion Leader: Lois Delcambre

Friday, 09 December 2011 @ 10:00 AM

Title: A Parameterized Representation of Uncertain Conceptual Spaces

Transactions in GIS, 2004

Author(s): Ola Ahlqvist

Available at:

Discussion Leader: Veronika Megler

Friday, 02 December 2011 @ 10:00 AM

Title: Discovering HERM

Author(s): Bernhard Thalheim

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre

Friday, 18 November 2011 @ 10:00 AM

Title: Variable Length Compression for Bitmap Indices

DEXA 2011

Author(s): Fabian Corrales, David Chiu and Jason Sawin

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: David Chiu

Friday, 04 November 2011 @ 10:00 AM

Title: Synthesizing Products for Online Catalogs

VLDB 2011

Author(s): Hoa Nguyen, Ariel Fuxman, Stelios Paparizos, Juliana Freire, Rakesh Agrawal

Available at:

Discussion Leader: Kristin Tufte

Friday, 28 October 2011 @ 10:00 AM

Title: Find it if you can: a game for modeling different types of web search success using interaction data

SIGIR 2011

Author(s): Mikhail Ageev, Qi Guo, Dmitry Lagun, and Eugene Agichtein

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 21 October 2011 @ 10:00 AM

Title: Entity-relationship queries over wikipedia

SMUC 2010

Author(s): Xiaonan Li, Chengkai Li, and Cong Yu

Available at:

Discussion Leader: Scott Britell

Friday, 14 October 2011 @ 10:00 AM

Title: Pregel: a system for large-scale graph processing


Author(s): Grzegorz Malewicz, Matthew H. Austern, Aart J.C Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski

Available at:

Discussion Leader: Dave Maier

Friday, 07 October 2011 @ 10:00 AM

Title: Bridging Two Worlds with RICE

VLDB 2011

Author(s): Philipp Grosse, Wolfgang Lehner, Thomas Weichert, Franz Farber, Wen-Syan

Available at:

Discussion Leader: Patrick Leyshock

Friday, 30 September 2011 @ 10:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:

Friday, 02 September 2011 @ 10:00 AM

Title: Model-independent schema and data translation

A runtime approach to model-generic translation of schema and data is proposed. It is based on our previous work on MIDST, a platform conceived to perform translations in an off-line fashion. In the original approach, the source database is imported into a dictionary, where it is stored according to a universal model. Then, the translation is applied within the tool as a composition of elementary transformation steps, specified as Datalog programs. Finally, the result is exported into the operational system. Here we illustrate a new, lightweight approach where the database is not imported. The tool needs only to know the model and the schema of the source database and generates views on the operational system that transform the underlying data (stored in the source schema) according to the corresponding schema in the target model. Views are generated in an almost automatic way, on the basis of the Datalog rules for schema translation. Current work on extensions of the approach to the family of the so called noSQL systems will also be sketched.


Available at:

Discussion Leader: Paolo Atzeni

Friday, 26 August 2011 @ 10:00 AM

Title: Learning in Query Optimization

Database Systems let users specify queries in a declarative language like SQL. Most modern DBMS optimizers rely upon a cost model to choose the best query execution plan (QEP) for any given query. Cost estimates are heavily dependent upon the optimizers estimates for the number of rows that will result at each step of the QEP for complex queries involving many predicates and/or operations. These estimates, in turn, rely upon statistics on the database and modeling assumptions that may or may not be true for a given database. In my talk, I will present an overview of the research on learning in query optimization. I will introduce the concept of a LEarning Optimizer (LEO) as a comprehensive way to repair incorrect statistics and cardinality estimates of a query execution plan. By monitoring executed queries, LEO compares the optimizers estimates with actuals at each step in a QEP, and computes adjustments to cost estimates and statistics that may be used during the current and future query optimizations. LEO introduces a feedback loop to query optimization that enhances the available information on the database where the most queries have occurred, allowing the optimizer to actually learn from its past mistakes. In the second part of the talk, I describe how the knowledge gleaned by LEO is exploited consistently in a query optimizer, by adjusting the optimizers model and by maximizing information entropy. In the third part of the talk, I will briefly sketch my current research work and vision on Information Management in the Cloud in the Stratosphere (massively parallel and distributed processing) and MIA (information marketplace) projects.


Available at:

Discussion Leader: Volker Markl

Friday, 19 August 2011 @ 10:00 AM

Title: Semantic Stream Query Optimization Exploiting Dynamic Metadata

ICDE 2011

Author(s): Luping Ding, Karen Works, Elke A. Rundensteiner

Available at:

Discussion Leader: David Maier

Friday, 12 August 2011 @ 10:00 AM

Title: Automatic schema merging using mapping constraints among incomplete sources

CIKM 2010

Author(s): Xiang Li, Christoph Quix, David Kensche, and Sandra Geisler

Available at:

Discussion Leader: Scott Britell

Friday, 05 August 2011 @ 10:00 AM

Title: Design and Implementation of Verifiable Audit Trails for a Versioning File System


Author(s): Zachary N. J. Peterson, Randal Burns, Giuseppe Ateniese, and Stephen Bono

Available at:

Discussion Leader: David Archer

Friday, 29 July 2011 @ 10:00 AM

Title: MonetDB/SQL Meets SkyServer: The Challenges of a Scientific Database

SSDBM 2007

Author(s): M. Ivanova, N. Nes, R. Gonclaves, and M. Kersten

Available at:

Discussion Leader: Patrick Leyshock

Friday, 15 July 2011 @ 10:00 AM

Title: Improving Recommender Systems by Incorporating Social Contextual Information

ACM TOIS Vol. 29, No. 2, 2011

Author(s): Hao Ma, Tom Chao Zhou, Michael R. Lyu, Irwin King

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 08 July 2011 @ 10:00 AM

Title: A Generic Database Schema for CIDOC-CRM Data Management

ADBIS 2011

Author(s): Kai Jannaschk, Claas Anders Rathje, Bernhard Thalheim, and Frank Förster

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre

Friday, 01 July 2011 @ 10:00 AM

Title: A method to map heterogeneity between near but non-equivalent semantic attributes in multiple health data registries

Health Informatics Journal Vol. 14 No. 1 2008

Author(s): Nadine Schuurman and Agnieszka Leszczynski

Available at:

Discussion Leader: Veronika Megler

Friday, 24 June 2011 @ 10:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone

Friday, 17 June 2011 @ 10:00 AM

Title: A Position Paper on Data Sovereignty: The Importance of Geolocating Data in the Cloud

USENIX HotCloud 11

Author(s): Zachary N.J. Peterson, Mark Gondree, and Robert Beverly

Available at:

Discussion Leader: Zachary Peterson

Friday, 03 June 2011 @ 10:00 AM

Title: Hybrid Merge/Overlap Execution Technique for Parallel Array Processing

EDBT/ICDT Array Databases Workshop 2011

Author(s): Emad Soroush and Magdalena Balazinska

Available at:

Discussion Leader: Patrick Leyshock

Friday, 20 May 2011 @ 10:00 AM

Title: Social media recommendation based on people and tags

SIGIR 2010

Author(s): I Guy, N Zwerdling, I Ronen, D Carmel, and E Uziel

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 06 May 2011 @ 10:00 AM

Title: C-MR: A Continuous-MapReduce Processing Model for Low-Latency Stream Processing on Multi-Core Architectures

Technical Report CS-10-01, Brown University, Feb. 2010

Author(s): N. Backman, K. Pattabiraman, U. Cetintemel

Available at:

Discussion Leader: Michael Grossniklaus

Friday, 29 April 2011 @ 10:00 AM

Title: Towards Practical Incremental Recomputation for Scientists: An Implementation for the Python Language

TaPP 2010

Author(s): Philip J. Guo and Dawson Engler

Available at:

Discussion Leader: David Archer

Friday, 22 April 2011 @ 10:00 AM

Title: How Soccer Players Would Do Stream Joins


Author(s): Jens Teubner and Rene Mueller

Available at:

Discussion Leader: Kristin Tufte

Friday, 15 April 2011 @ 10:00 AM

Title: A Generalized Join Algorithm

BTW 2011

Author(s): Goetz Graefe

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Len Shapiro

Friday, 08 April 2011 @ 10:00 AM

Title: A co-Relational Model of Data for Large Shared Data Banks


Author(s): Erik Meijer and Gavin Bierman

Available at:

Discussion Leader: David Maier

Friday, 18 March 2011 @ 10:00 AM

Title: Evolution and Future Directions of Large Scale Storage and Computation Systems at Google

SoCC 2010

Author(s): Jeffrey Dean

Available at:

Discussion Leader: Len Shapiro

Friday, 11 March 2011 @ 10:00 AM

Title: Indexing Multi-dimensional Data in a Cloud System


Author(s): Jinbao Wang, Sai Wu, Hong Gao, Jianzhong Li, and Beng Chin Ooi

Available at:

Discussion Leader: Travis Hall

Friday, 04 March 2011 @ 10:00 AM

Title: Scalable Data Integration by Mapping Data to Queries

Technical Report 633, Dept. of Computer Science, ETH Zurich, July 2009

Author(s): Martin Hentschel, Donald Kossman, Daniela Florescu, Laura Haas, Tim Kraska, and Renee J. Miller

Available at:

Discussion Leader: Scott Britell

Friday, 25 February 2011 @ 10:00 AM

Title: Incorporating Partitioning and Parallel Plans into the SCOPE Optimizer

ICDE 2010

Author(s): Jingren Zhou, Per-Ake Larson, and Ronnie Chaiken

Available at:

Discussion Leader: Michael Grossniklaus

Friday, 18 February 2011 @ 10:00 AM

Title: In Support of Mesodata in Database Management Systems

LNCS vol. 3180, 2004

Author(s): Denise de Vries, Sally Rice, and John F. Roddick

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Veronika Megler

Friday, 11 February 2011 @ 10:00 AM

Title: An Architecture for Recycling Intermediates in a Column-store


Author(s): Milena G. Ivanova, Martin L. Kersten, Niels J. Nes, and Romulo A.P. Goncalves

Available at:

Discussion Leader: Patrick Leyshock

Friday, 04 February 2011 @ 10:00 AM

Title: Secure kNN Computation on Encrypted Databases


Author(s): Wai Kit Wong, David Wai-lok Cheung, Ben Kao, and Nikos Mamoulis

Available at:

Discussion Leader: Farhana Kabir

Friday, 28 January 2011 @ 10:00 AM

Title: Consistency Analysis in Bloom: a CALM and Collected Approach

CIDR 2011

Author(s): Peter Alvaro, Neil Conway, Joseph M. Hellerstein, and William R. Marczak

Available at:

Discussion Leader: David Maier

Friday, 21 January 2011 @ 10:00 AM

Title: G-Store: A Scalable Data Store for Transactional Multi key Access in the Cloud

SoCC 2010

Author(s): Sudipto Das, Divyakant Agrawal, and Amr El Abbadi

Available at:

Discussion Leader: David Chiu

Friday, 14 January 2011 @ 10:00 AM

Title: Analyzing the Energy Efficiency of a Database Server


Author(s): Dimitris Tsirogiannis, Stavros Harizopoulos, and Mehul A. Shah

Available at:

Discussion Leader: Kristen Tufte

Friday, 10 December 2010 @ 10:00 AM

Title: APEX: An Adaptive Path Index for XML Data


Author(s): Chin-Wan Chung, Jun-Ki Min, and Kyuseok Shim

Available at:

Discussion Leader: David Archer

Friday, 03 December 2010 @ 10:00 AM

Title: Horizontally Scalable Data Stores

Author(s): Rick Cattell

Available at:

Discussion Leader: Len Shapiro

Friday, 19 November 2010 @ 10:00 AM

Title: Large-scale Incremental Processing Using Distributed Transactions and Notifications

OSDI 2010

Author(s): Daniel Peng and Frank Dabek

Available at:

Discussion Leader: Scott Britell

Friday, 12 November 2010 @ 10:00 AM

Title: Continuous Subgraph Pattern Search over Certain and Uncertain Graph Streams

TKDE 22:8, August 2010

Author(s): Lei Chen and Changliang Wang

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 05 November 2010 @ 10:00 AM

Title: Capturing the Uncertainty of Moving-Object Representations

SSD 1999

Author(s): Dieter Pfoser and Christian S. Jensen

Available at:

Discussion Leader: Veronika Megler

Friday, 29 October 2010 @ 10:00 AM

Title: SECRET: A Model for Analysis of the Execution Semantics of Stream Processing Systems

VLDB 2010

Author(s): Irina Botan, Roozbeh Derakhshan, Nihal Dindar, Laura Haas, Renee J. Miller, and Nesime Tatbul

Available at:

Discussion Leader: David Maier

Friday, 22 October 2010 @ 3:00 PM

Title: UPI: A Primary Index for Uncertain Databases

VLDB 2010

Author(s): Hideaki Kimura, Samuel Madden, and Stanley B. Zdonik

Available at:

Discussion Leader: Patrick Leyshock

Friday, 15 October 2010 @ 10:00 AM

Title: Manimal: Relational Optimization for Data-Intensive Programs

WebDB 2010

Author(s): Michael J. Cafarella and Christopher Re

Available at:

Discussion Leader: Nick Rayner

Friday, 08 October 2010 @ 10:00 AM

Title: CORADD: Correlation Aware Database Designer for Materialized Views and Indexes

VLDB 2010

Author(s): Hideaki Kimura, George Huo, Alexander Rasin, Samuel Madden, and Stanley B. Zdonik

Available at:

Discussion Leader: Lois Delcambre

Friday, 03 September 2010 @ 10:00 AM

Title: Efficient Pattern Matching over Event Streams


Author(s): Jagrati Agrawal, Yanlei Diao, Daniel Gyllstrom, and Neil Immerman

Available at:

Discussion Leader: Amit Bhat

Friday, 27 August 2010 @ 10:00 AM

Title: Provenance-Based Refresh in Data-Oriented Workflows

Stanford InfoLab Technical Report, July 2010

Author(s): Robert Ikeda, Semih Salihoglu, and Jennifer Widom

Available at:

Discussion Leader: David Archer

Friday, 20 August 2010 @ 10:00 AM

Title: Online Aggregation


Author(s): Joseph M. Hellerstein, Peter J. Haas, and Helen J. Wang

Available at:

Discussion Leader: Len Shapiro

Friday, 13 August 2010 @ 10:00 AM

Title: An Extensible Test Framework for the Microsoft StreamInsight Query Processor

DBTest 2010

Author(s): Alex Raizman, Asvin Ananthanarayan, Anton Kirilov, Badrish Chandramouli, and Mohamed Ali

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 06 August 2010 @ 10:00 AM

Title: F-Logic: A Higher-Order Language for Reasoning About Objects, Inheritance, and Scheme


Author(s): Michael Kifer and Georg Lausen

Available at:

Discussion Leader: David Maier

Friday, 30 July 2010 @ 10:00 AM

Title: Towards Adaptive, Flexible, and Self-tuned Database Systems

EPFL EDIC Research Proposal, July 2010

Author(s): Ioannis Alagiannis

Available at:

Discussion Leader: Dan Colish

Friday, 23 July 2010 @ 10:00 AM

Title: Querying in Highly Mobile Distributed Environments

VLDB 1992

Author(s): Tomasz Imielinski and B. R. Badrinath

Available at:

Discussion Leader: Veronika Megler

Friday, 16 July 2010 @ 10:00 AM

Title: A Transactional Model for Long-Running Activities

VLDB 1991

Author(s): Umeshwar Dayal, Meichun Hsu, and Rivka Ladin

Available at:

Discussion Leader: Patrick Leyshock

Friday, 09 July 2010 @ 10:00 AM

Title: BIRCH: An Efficient Data Clustering Method for Very Large Databases

SIGMOD Record, June 1996

Author(s): Tian Zhang, Raghu Ramakrishnan, and Miron Livny

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 02 July 2010 @ 10:00 AM

Title: Making Web Annotations Persistent over Time

JCDL 2010

Author(s): Robert Sanderson and Herbert Van de Sompel

Available at:

Discussion Leader: Lois Delcambre

Friday, 11 June 2010 @ 10:00 AM

Title: Transformation of Continuous Aggregation Join Queries over Data Streams

SSTD 2007

Author(s): Tri Minh Tran and Byung Suk Lee

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 04 June 2010 @ 10:00 AM

Title: A Statistical Comparison of Tag and Query Logs

SIGIR 2009

Author(s): Mark J. Carman, Mark Baillie, Robert Gwadera, and Fabio Crestani

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 28 May 2010 @ 10:00 AM

Title: CodeQuest: Scalable Source Code Queries with Datalog

ECOOP 2006

Author(s): Elnar Hajiyev, Mathieu Verbaere, and Oege de Moor

Available at:

Discussion Leader: Nick Rayner

Friday, 21 May 2010 @ 10:00 AM

Title: JouleSort: A Balanced Energy-Efficient Benchmark


Author(s): Suzanne Rivoire, Mehul Shah, Parthasarathy Ranganathan, and Christos Kozyrakis

Available at:

Discussion Leader: Len Shapiro

Friday, 30 April 2010 @ 10:00 AM

Title: A Graph Model of Data and Workflow Provenance

TAPP 2010

Author(s): Umut Acar, Peter Buneman, James Cheney, Jan Van den Bussche, Natalia Kwasnikowska, and Stijn Vansummeren

Available at:

Discussion Leader: David Archer

Friday, 23 April 2010 @ 10:00 AM

Title: Composition and Inversion of Schema Mappings

SIGMOD Record, September 2009

Author(s): Marcelo Arenas, Jorge Perez, Juan Reutter, and Cristian Riveros

Available at:

Discussion Leader: Dan Colish

Friday, 16 April 2010 @ 10:00 AM

Title: Exploiting Predicate-Window Semantics over Data Streams

SIGMOD Record, March 2006

Author(s): Thanaa M. Ghanem, Walid G. Aref, and Ahmed K. Elmagarmid

Available at:

Discussion Leader: Amit Bhat

Friday, 09 April 2010 @ 10:00 AM

Title: A General Datalog-Based Framework for Tractable Query Answering over Ontologies

PODS 2009

Author(s): Andrea Cali, Georg Gottlob, and Thomas Lukasiewicz

Available at:

Discussion Leader: David Maier

Friday, 19 March 2010 @ 10:00 AM

Title: USHER: Improving Data Quality with Dynamic Forms

ICDE 2010

Author(s): Kuang Chen, Harr Chen, Neil Conway, Joseph M. Hellerstein, and Tapan S. Parikh

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 12 March 2010 @ 10:00 AM

Title: Semantic Representation of Context Models: a Framework for Analyzing and Understanding

CIAO 2009

Author(s): Salma Najar, Oumaima Saidani, Manuele Kirsch-Pinheiro, Carine Souveyet, and Selmen Nurcan

Available at:

Discussion Leader: Nick Rayner

Friday, 05 March 2010 @ 10:00 AM

Title: A Conceptual View on Trajectories

Data & Knowledge Engineering 65:126-46, 2008

Author(s): S. Spaccapietra, C. Parent, M. Damiani, J. Macedo, F. Porta, and C. Vangenot

Available at:

Discussion Leader: Veronika Megler

Friday, 26 February 2010 @ 10:00 AM

Title: Object Reuse & Exchange: A Resource-Centric Approach

The Computing Research Repository April 2008

Author(s): Carl Lagoze, Herbert Van de Sompel, Michael L. Nelson, Simeon Warner, Robert Sanderson, and Pete Johnston

Available at:

Discussion Leader: Scott Britell

Friday, 19 February 2010 @ 10:00 AM

Title: Declarative Support for Sensor Data Cleaning

International Conference on Pervasive Computing 2006

Author(s): Shawn R. Jeffery, Gustavo Alonso, Michael J. Franklin, Wei Hong, and Jennifer Widom

Available at:

Discussion Leader: Len Shapiro

Friday, 12 February 2010 @ 10:00 AM

Title: A Framework for Semantic Link Discovery over Relational Data

CIKM 2009

Author(s): Oktie Hassanzadeh, Anastasios Kementsietsidis, Lipyeow Lim, Renee J. Miller, and Min Wang

Available at:

Discussion Leader: Dan Colish

Friday, 05 February 2010 @ 10:00 AM

Title: Evolving Objects in Temporal Information Systems

Annals of Mathematics and Artificial Intelligence, June 2007

Author(s): Alessandro Artale, Christine Parent, and Stefano Spaccapietra

Available at:

Discussion Leader: David Archer

Friday, 29 January 2010 @ 10:00 AM

Title: A Session Based Personalized Search Using an Ontological User Profile

SAC 2009

Author(s): Mariam Daoud, Lynda Tamine-Lechani, Mohand Boughanem, and Bilal Chebaro

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 22 January 2010 @ 10:00 AM

Title: Anchor Modeling

ER 2009

Author(s): O. Regardt, L. Ronnback, M. Bergholtz, P. Johannesson, and P. Wohed

Available at:

Discussion Leader: Lois Delcambre

Friday, 15 January 2010 @ 10:00 AM

Title: Impact of Disk Corruption on Open-Source DBMS

ICDE 2010

Author(s): S. Subramanian, Y. Zhang, R. Vaiyanathan, H. S. Gunawi, A. C. Arpaci-Dusseau, R. H. Arpaci-Dusseau, and J. F. Naugton

Available at:

Discussion Leader: David Maier

Friday, 11 December 2009 @ 10:00 AM

Title: Characteristic Relational Patterns

KDD '09

Author(s): Arne Koopman and Arno Siebes

Available at:

Discussion Leader: Nick Rayner

Friday, 04 December 2009 @ 10:00 AM

Title: Understanding the Semantics of Data Provenance to Support Active Conceptual Modeling

Author(s): Sudha Ram and Jun Liu

Available at:

Discussion Leader: David Archer

Friday, 20 November 2009 @ 10:00 AM

Title: Typed Datalog

PADL '09

Author(s): David Zook, Emir Pasalic, and Beata Sarna-Starosta

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: David Maier

Friday, 13 November 2009 @ 10:00 AM

Title: A Comparison of Approaches to Large-Scale Data Analysis


Author(s): Andrew Pavlo, Erik Paulson, Alexander Rasin, Daniel J. Abadi, David J. DeWitt, Samuel Madden, and Michael Stonebraker

Available at:

Discussion Leader: Veronika Megler

Friday, 06 November 2009 @ 11:30 AM

Title: Information Scraps: How and Why Information Eludes our Personal Information Management Tools

TOIS 26:4, September 2008

Author(s): Michael Bernstein, Max Van Kleek, David Karger, and M. C. Schraefel

Available at:

Discussion Leader: Amit Bhat

Friday, 30 October 2009 @ 10:00 AM

Title: Stream Warehousing with DataDepot


Author(s): Lukasz Golab, Theodore Johnson, J. Spencer Seidel, and Vladislav Shkapenyuk

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 23 October 2009 @ 10:00 AM

Title: Declarative Support for Sensor Data Cleaning

ICPC '06

Author(s): Shawn R. Jeffery, Gustavo Alonso, Michael J. Franklin, Wei Hong, and Jennifer Widom

Available at:

Discussion Leader: Len Shapiro

Friday, 16 October 2009 @ 10:00 AM

Title: Semantics and Implementation of Continuous Sliding Window Queries Over Data Streams

TODS 34:1, April 2009

Author(s): Jurgen Kramer and Bernhard Seeger

Available at:

Discussion Leader: David Maier

Friday, 09 October 2009 @ 10:00 AM

Title: Reasonable Tag-based Collaborative Filtering for Social Tagging Systems


Author(s): Reyn Y. Nakamoto, Shinsuke Nakajima, Jun Miyazaki, Shunsuke Uemura, Hirokazu Kato, and Youichi Inagaki

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 11 September 2009 @ 10:00 AM

Title: Prefilter: Predicate Pushdown at Streaming Speeds

SSPS '08

Author(s): Lukasz Golab, Theodore Johnson, and Oliver Spatscheck

Available at:

Discussion Leader: Amit Bhat

Friday, 21 August 2009 @ 10:00 AM

Title: Adaptive Control of Extreme-Scale Stream Processing Systems

CDCS '06

Author(s): Lisa Amini, Navendu Jain, Anshul Sehgal, Jeremy Silber, and Olivier Verscheure

Available at:

Discussion Leader: Veronika Megler

Friday, 07 August 2009 @ 10:00 AM

Title: GDM: A New Graph Based Data Model Using Functional Abstractionx [sic]

J. Comput. Sci. & Technol., May 2006

Author(s): Sankhayan Choudhury, Nabendu Chaki, and Swapan Bhattacharya

Available at:

Discussion Leader: David Archer

Friday, 29 May 2009 @ 10:00 AM

Title: Automatic Verification of Database-Driven Systems: A New Frontier

ICDT 2009

Author(s): Victor Vianu

Available at:

Discussion Leader: Nick Rayner

Friday, 22 May 2009 @ 10:00 AM

Title: Combating Spam in Tagging Systems: An Evaluation

ACM Transactions on the Web 2(3), October 2008

Author(s): Georgia Koutrika, Frans Adjie Effendi, Zoltan Gyongyi, Paul Heymann, and Hector Garcia-Molina

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 08 May 2009 @ 10:00 AM

Title: Containment of Conjunctive Queries on Annotated Relations

ICDT 2009

Author(s): Todd J. Green

Available at:

Discussion Leader: David Archer

Friday, 01 May 2009 @ 10:00 AM

Title: Scalable Regular Expression Matching on Data Streams


Author(s): Anirban Majumder, Rajeev Rastogi, and Sriram Vanama

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 24 April 2009 @ 10:00 AM

Title: Recursive Computation of Regions and Connectivity in Networks

ICDE 2009

Author(s): Mengmeng Liu, Nicholas Taylor, Wenchao Zhou, Zachary Ives, and Boon Thau Loo

Available at:

Discussion Leader: David Maier

Friday, 17 April 2009 @ 10:00 AM

Title: Transformation-based Framework for Record Matching

ICDE 2008

Author(s): Arvind Arasu, Surajit Chaudhuri, and Raghav Kaushik

Available at:

Discussion Leader: Len Shapiro

Friday, 10 April 2009 @ 11:00 AM

Title: Fedora: An Architecture for Complex Objects and their Relationships

International Journal on Digital Libraries 6(2), April 2006

Author(s): Carl Lagoze, Sandy Payette, Edwin Shin, and Chris Wilper

Available at:

Discussion Leader: Lois Delcambre

Friday, 10 April 2009 @ 10:00 AM

Title: NCORE: Architecture and Implementation of a Flexible, Collaborative Digital Library


Author(s): Dean B. Krafft, Aaron Birkland, and Ellen J. Cramer

Available at:

Discussion Leader: Lois Delcambre

Friday, 13 March 2009 @ 10:00 AM

Title: SCADS: Scale-Independent Storage for Social Computing Applications

CIDR 2009

Author(s): Michael Armbrust, Armando Fox, David Patterson, Nick Lanham, Beth Trushkowsky, Jesse Trutna, and Haruki Oh

Available at:

Discussion Leader: Len Shapiro

Friday, 06 March 2009 @ 10:00 AM

Title: A SQL Database System for Solving Constraints


Author(s): Sebastien Siva and Lesi Wang

Available at:

Discussion Leader: Nick Rayner

Friday, 27 February 2009 @ 10:00 AM

Title: ViP: A User-Centric View-Based Annotation Framework for Scientific Data

SSDBM 2008

Author(s): Qinglan Li, Alexandros Labrinidis, and Panos K. Chrysanthis

Available at:

Discussion Leader: David Archer

Friday, 20 February 2009 @ 11:00 AM

Title: Eventually Consistent

Communications of the ACM, January 2009

Author(s): Werner Vogels

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 20 February 2009 @ 10:00 AM

Title: Google's Deep Web Crawl

VLDB 2008

Author(s): Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, and Alon Halevy

Available at:

Discussion Leader: Len Shapiro

Friday, 13 February 2009 @ 10:00 AM

Title: RIOT: I/O-Efficient Numerical Computing without SQL

CIDR 2009

Author(s): Yi Zhang, Herodotos Herodotou, and Jun Yang

Available at:

Discussion Leader: David Maier

Friday, 06 February 2009 @ 10:00 AM

Title: A Unified and Discriminative Model for Query Refinement


Author(s): Jiafeng Guo, Gu Xu, Hang Li, and Xueqi Cheng

Available at:

Discussion Leader: Nick Rayner

Friday, 30 January 2009 @ 10:00 AM

Title: Capturing Data Uncertainty in High-Volume Stream Processing

CIDR 2009

Author(s): Yanlei Diao, Boduo Li, Anna Liu, Liping Peng, Charles Sutton, Thanh Tran, and Michael Zink

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 23 January 2009 @ 10:00 AM

Title: Graph Database Indexing Using Structured Graph Decomposition

ICDE 2007

Author(s): David W. Williams, Jun Huan, and Wei Wang

Available at:

Discussion Leader: David Archer

Friday, 16 January 2009 @ 10:00 AM

Title: Data Management for High-Throughput Genomics

CIDR 2009

Author(s): Uwe Rohm and Jose A. Blakeley

Available at:

Discussion Leader: David Maier

Friday, 05 December 2008 @ 10:00 AM

Title: On the Expressiveness of Implicit Provenance in Query and Update Languages

ICDT 2007

Author(s): P. Buneman, J. Cheney, S. Vansummeren

Available at:

Discussion Leader: David Archer

Friday, 21 November 2008 @ 10:00 AM

Title: Interactive Paper as a Reading Medium in Digital Libraries

ECDL 2008

Author(s): M. Morrie, B. Signer, N. Weibel

Available at:

Discussion Leader: Jeremy Steinhauer

Friday, 14 November 2008 @ 10:00 AM

Title: Integrating Urban Form and Demographics in Water Demand MAnagement: An Empirical Case Study of Portland, Oregon (USA)

Author(s): Vivek Shandas, G. Hossein Parandavash

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Len Shapiro

Friday, 07 November 2008 @ 10:00 AM

Title: Clustera: An Integrated Computation and Data Management System

VLDB '08

Author(s): David J. DeWitt, Erik Paulson, Eric Robinson, et al.

Available at:

Discussion Leader: Kristen Tufte

Friday, 31 October 2008 @ 10:00 AM

Title: Schema Mapping Verification: The Spicy Way

EDBT '08

Author(s): A. Bonifati, G. Mecca, A. Pappalardo, et al.

Available at:

Discussion Leader: Nick Rayner

Friday, 24 October 2008 @ 10:00 AM

Title: Towards a Streaming SQL Standard

VLDB '08

Author(s): Namit Jain, Johannes Gehrke, Jennifer Widom, et al.

Available at:

Discussion Leader: Raphael J. Fernandez-Moctezuma

Friday, 10 October 2008 @ 10:00 AM

Title: Flying Fixed-Point: Recursive Processing in Stream Queries

Author(s): Johnathan Goldstein and David Maier

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: David Maier

Friday, 15 August 2008 @ 10:00 AM

Title: Flexible and Efficient IR Using Array Databases

The VLDB Journal, January 2008

Author(s): Roberto Cornacchia, Sandor Heman, Marcin Zukowski, Arjen P. Vries, Peter Boncz

Available at:

Discussion Leader: Dave Maier

Friday, 08 August 2008 @ 10:00 AM

Title: Pay-as-You-Go User Feedback for Dataspace Systems


Author(s): Shawn R. Jeffery, Michael J. Franklin, and Alon Y. Halevy

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 01 August 2008 @ 10:00 AM

Title: Provenance Management in Curated Databases


Author(s): Peter Buneman, Adriane Chapman, and James Cheney

Available at:

Discussion Leader: David Archer

Friday, 25 July 2008 @ 10:00 AM

Title: Bootstrapping Pay-as-You-Go Data Integration Systems


Author(s): Anish Das Sarma, Xin Dong, and Alon Halevy

Available at:

Discussion Leader: Nick Rayner

Friday, 18 July 2008 @ 10:00 AM

Title: Extreme Visualization: Squeezing a Billion Records into a Million Pixels


Author(s): Ben Shneiderman

Available at:

Discussion Leader: Len Shapiro

Friday, 11 July 2008 @ 10:00 AM

Title: A Visual Environment for Dynamic Web Application Composition

Proceedings of the Fourteenth ACM Conference on Hypertext and Hypermedia (HYPERTEXT '03)

Author(s): Kimihito Ito and Yuzuru Tanaka

Available at:

Discussion Leader: Lois Delcambre

Friday, 13 June 2008 @ 10:00 AM

Title: GPUTeraSort: High Performance Graphics Coprocessor Sorting for Large Database Management


Author(s): Naga K. Govindaraju, Jim Gray, Ritesh Kumar, and Dinesh Manocha

Available at:

Discussion Leader: Len Shapiro

Friday, 06 June 2008 @ 10:00 AM

Title: A Security Punctuation Framework for Enforcing Access Control on Streaming Data

ICDE '08

Author(s): Rimma V. Nehme, Elke A. Rundensteiner, and Elisa Bertino

Available at:

Discussion Leader: Dave Maier

Friday, 30 May 2008 @ 10:00 AM

Title: Sketching Probabilistic Data Streams


Author(s): Graham Cormode, and Minos Garofalakis

Available at:

Discussion Leader: Rafael J. Fernández-Moctezuma

Friday, 23 May 2008 @ 10:00 AM

Title: Sideways Information Passing for Push-Style Query Processing

ICDE '08

Author(s): Zachary G. Ives, and Nicholas E. Taylor

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 09 May 2008 @ 10:00 AM

Title: Update Exchange with Mappings and Provenance.

VLDB '07

Author(s): Green, T., Karvounarakis, G., Ives, Z., Tannen, V.

Available at:

Discussion Leader: David Archer

Friday, 02 May 2008 @ 10:00 AM

Title: From Dirt to Shovels: Fully Automatic Tool Generation from Ad Hoc Data

POPL '08

Author(s): Kathleen Fisher, David Walker, Kenny Q. Zhu, and Peter White

Available at:

Discussion Leader: Nick Rayner

Friday, 25 April 2008 @ 10:00 AM

Title: Query-Aware Partitioning for Monitoring Massive Network Data Streams


Author(s): Vladislav Shkapenyuk, Ted Johnson, Oliver Spatscheck, and S. Muthukrishnan

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Jin Li

Friday, 18 April 2008 @ 10:00 AM

Title: Bridging the Application and DBMS Profiling Divide for Database Application Developers

VLDB '07

Author(s): Surajit Chaudhuri, Vivek Narasayya, and Manoj Syamala

Available at:

Discussion Leader: James Terwilliger

Friday, 11 April 2008 @ 10:00 AM

Title: Column-Stores vs. Row-Stores: How Different Are They Really?


Author(s): D. Abadi, S. Madden, N. Hachem

Available at:

Discussion Leader: Kristin Tufte

Friday, 04 April 2008 @ 10:00 AM

Title: Web 3.0: Chicken Farms on the Semantic Web

Author(s): Jim Hendler

Available at:

Discussion Leader: Len Shapiro

Friday, 04 April 2008 @ 10:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone

Friday, 07 March 2008 @ 10:00 AM

Title: Continuous Queries in Oracle

VLDB '07

Author(s): Andrew Witkowski, Srikanth Bellamkonda, Hua-Gang Li, Vince Liang, Lei Sheng, Wayne Smith, Sankar Subramanian, James Terry, Tsae-Feng Yu

Available at:

Discussion Leader: Jin Li

Friday, 29 February 2008 @ 10:00 AM

Title: Database Virtualization: A New Frontier for Database Tuning and Physical Design

ICDE '07

Author(s): Soror, Ahmed A.; Aboulnaga, Ashraf; Salem, Kenneth

Available at:

Discussion Leader: Len Shapiro

Friday, 22 February 2008 @ 10:00 AM

Title: Cooperative scans: dynamic bandwidth sharing in a DBMS

VLDB '07

Author(s): Marcin Zukowski and Sandor Heman and Niels Nes and Peter Boncz

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 15 February 2008 @ 10:00 AM

Title: Autonomously Semantifying Wikipedia

CIKM '07

Author(s): Fei Wu and Daniel Weld

Available at:

Discussion Leader: Susan Price

Friday, 08 February 2008 @ 10:00 AM

Title: Entity Resolution with Markov Logic

ICDM '06

Author(s): Parag Singla, and Pedro Domingos

Available at:

Discussion Leader: Nick Rayner

Friday, 01 February 2008 @ 10:00 AM

Title: Making database systems usable


Author(s): H. V. Jagadish et al.

Available at:

Discussion Leader: James Terwilliger

Friday, 25 January 2008 @ 10:00 AM

Title: Consistency Sensitive Operators in CEDR

Author(s): Jonathan Goldstein, Mingsheng Hong, Mohamed Ali, and Roger Barga

Available at:

Discussion Leader: Rafael Fernandez

Friday, 18 January 2008 @ 10:00 AM

Title: A Formal Characterization of PIVOT/UNPIVOT

CIKM '05

Author(s): Wyss, C., Robertson, E.

Available at:

Discussion Leader: David Archer

Friday, 11 January 2008 @ 10:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone

Friday, 07 December 2007 @ 10:00 AM

Title: Efficient Use of the Query Optimizer for Automated Physical Design

VLDB '07

Author(s): Stratos Papadomanolakis, Debabrata Dash, and Anastasia Ailamaki

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 30 November 2007 @ 10:00 AM

Title: Conditional Functional Dependencies for Data Cleaning

ICDE '07

Author(s): Philip Bohannon, Wenfei Fan, Floris Geerts, Xibei Jia, and Anastasios Kementsietsidis

Available at:

Discussion Leader: Jin Li

Friday, 16 November 2007 @ 10:00 AM

Title: Similarity Search: A Matching Based Approach

VLDB '06

Author(s): Anthony K. H. Tung, Rui Zhang, Nick Koudas, and Beng Chin Ooi

Available at:

Discussion Leader: Rafael Fernandez

Friday, 02 November 2007 @ 10:00 AM

Title: FICSR: Feedback-based InConSistency Resolution and query processing


Author(s): Yan Qi, K. Selcuk Candan, and Maria Luisa Sapino

Available at:

Discussion Leader: Nick Rayner

Friday, 26 October 2007 @ 10:00 AM

Title: Annotation Strategy

Author(s): Denise Bedford

Available at:

Discussion Leader: David Archer

Friday, 19 October 2007 @ 10:00 AM

Title: The Making of TPC-DS

VLDB '06

Author(s): Ragunath Othayoth, and Meikel Poess

Available at:

Discussion Leader: Len Shapiro

Friday, 12 October 2007 @ 10:00 AM

Title: A Relational Approach to Incrementally Extracting and Querying Structure in Unstructured Data

VLDB '07

Author(s): Eric Chu, Akanksha Baid, Ting Chen, AnHai Doan, and Jeffrey F. Naughton

Available at:

Discussion Leader: David Maier

Friday, 05 October 2007 @ 10:00 AM

Title: Teaching a Schema Translator to Produce O/R Views

Author(s): Peter Mork, Philip A. Bernstein, and Sergey Melnik

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: James Terwilliger

Friday, 28 September 2007 @ 11:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone

Friday, 14 September 2007 @ 10:00 AM

Title: Scaling Games to Epic Proportions


Author(s): Walker White, Alan Demers, Christoph Koch, Johannes Gehrke, and Rajmohan Rajagopalan

Available at:

Discussion Leader: Dave Maier

Friday, 07 September 2007 @ 10:00 AM

Title: Extracting entity profiles from semistructured information spaces

SIGMOD Rec. 26, 4 (Dec. 1997), 32-38

Author(s): Nado, R. A. and Huffman, S. B.

Available at:

Discussion Leader: David Archer

Friday, 24 August 2007 @ 10:00 AM

Title: Intensional Associations Between Data and Metadata


Author(s): Divesh Srivastava, and Yannis Velegrakis

Available at:

Discussion Leader: Nick Rayner

Friday, 17 August 2007 @ 9:30 AM

Title: The Complex Dynamics of Collaborative Tagging

WWW 2007

Author(s): Harry Halpin, Valentin Robu, and Hana Shepherd

Available at:

Discussion Leader: Susan Price

Friday, 10 August 2007 @ 10:00 AM

Title: Scaling Up All Pairs Similarity Search

WWW 2007

Author(s): Roberto J. Bayardo, Yiming Ma, and Ramakrishnan Srikant

Available at:

Discussion Leader: Rafael Fernandez

Friday, 03 August 2007 @ 10:00 AM

Title: In-Memory Grid Files on Graphics Processors

DaMon '07

Author(s): Ke Yang, Bingsheng He, Rui Fang, Mian Lu, Naga Govindaraju, Qiong Luo, Pedro Sander, and Jiaoying Shi

Available at:

Discussion Leader: Kristin Tufte

Friday, 27 July 2007 @ 9:00 AM

Title: Map-reduce-merge: simplified relational data processing on large clusters


Author(s): Hung-chih Yang, Ali Dasdan, Ruey-Lung Hsiao, and D. Stott Parker

Available at:

Discussion Leader: Len Shapiro

Friday, 20 July 2007 @ 10:00 AM

Title: iTrails: Pay-as-you-go Information Integration in Dataspaces

VLDB '07

Author(s): Marcos Antonio Vaz Salles, Jens-Peter Dittrich, Shant Kirakos Karakashian, Olivier René Girard, and Lukas Blunschi

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Dave Maier

Friday, 13 July 2007 @ 10:00 AM

Title: The five-minute rule twenty years later, and how flash memory changes the rules

DaMon '07

Author(s): Goetz Graefe

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 15 June 2007 @ 10:00 AM

Title: Sharing Aggregate Computation for Distributed Queries


Author(s): Ryan Huebsch, Minos Garofalakis, Joseph M. Hellerstein, and Ion Stoica

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 08 June 2007 @ 10:00 AM

Title: Personalized Queries under a Generalized Preference Model

ICDE '05

Author(s): Georgia Koutrika and Yannis Ioannidis

Available at:

Discussion Leader: Len Shapiro

Friday, 01 June 2007 @ 10:00 AM

Title: Toward Entity Retrieval over Structured and Text Data

WIRD '04

Author(s): Sayyadian, M., Shakery, A., Doan, A., Zhai, C.

Available at:

Discussion Leader: David Archer

Friday, 25 May 2007 @ 10:00 AM

Title: MoBIoS: a Metric-Space DBMS to Support Biological Discovery


Author(s): Daniel Miranker, Weijia Xu, and Rui Mao

Available at:

Discussion Leader: David Maier

Friday, 18 May 2007 @ 10:00 AM

Title: Efficient Reverse k-Nearest Neighbor Search in Arbitrary Metric Spaces


Author(s): Elke Achtert, Christian Bohm, Peer Kroger, Peter Kunath, Alexey Pryakhin, and Matthias Renz

Available at:

Discussion Leader: Rafael Fernandez

Friday, 11 May 2007 @ 10:00 AM

Title: Safety guarantee of continuous join queries over punctuated data streams

VLDB '06

Author(s): Hua-Gang Li, Songting Chen, Junichi Tatemura, Divyakant Agrawal, Selcuk Kandan, and Wang-Pin Hsiung

Available at:

Discussion Leader: Jin Li

Friday, 04 May 2007 @ 10:00 AM

Title: Realizing Parallelism in Database Operations: Insights from a Massively Multithreaded Architecture

Author(s): John Cieslewicz, Jonathan Berry, Bruce Hendrickson, Kenneth A. Ross

Available at:

Discussion Leader: Kristin Tufte

Friday, 27 April 2007 @ 10:00 AM

Title: Compiling Mappings to Bridge Applications and Databases


Author(s): Sergey Melnik, Atul Adya, and Phil Bernstein

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: James Terwilliger

Friday, 20 April 2007 @ 10:00 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Susan Price

Friday, 13 April 2007 @ 10:00 AM

Title: A Relational View of the Semantic Web

Author(s): Andrew Newman

Available at:

Discussion Leader: Lois Delcambre

Friday, 13 April 2007 @ 10:00 AM

Title: SPARQL Query Language for RDF

Author(s): W3C Candidate Recommendation

Available at:

Discussion Leader: Lois Delcambre

Friday, 23 March 2007 @ 10:00 AM

Title: SEMEX: Toward On-the-fly Personal Information Integration

VLDB II Web workshop 2004

Author(s): Dong, X., Halevy, A., Nemes, E., Sigurdsson, S., Domingos, P.

Available at:

Discussion Leader: David Archer

Friday, 09 March 2007 @ 10:00 AM

Title: Declarative networking: language, execution and optimization


Author(s): Boon Thau Loo, Tyson Condie, Minos Garofalakis, David E. Gay, Joseph M. Hellerstein, Petros Maniatis, Raghu Ramakrishnan, Timothy Roscoe, and Ion Stoica

Available at:

Discussion Leader: David Maier

Friday, 02 March 2007 @ 10:00 AM

Title: Consistent Streaming Through Time: A Vision for Event Stream Processing

CIDR '07

Author(s): Roger Barga, Jonathan Goldstein, Mohamed Ali, and Mingsheng Hong

Available at:

Discussion Leader: Jin Li

Friday, 23 February 2007 @ 10:00 AM

Title: A Heterogeneous Field Matching Method for Record Linkage

ICDM '05

Author(s): S. Minton, C. Nanjo, C. A. Knoblock, M. Michalowski, and M. Michelson

Available at:

Discussion Leader: Nick Rayner

Friday, 16 February 2007 @ 10:00 AM

Title: Life Beyond Distributed Transactions: An Apostate's Opinion

CIDR '07

Author(s): Pat Helland

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 09 February 2007 @ 10:00 AM

Title: A language modeling approach to information retrieval


Author(s): Jay M. Ponte, W. Bruce Croft

Available at:

Discussion Leader: Susan Price

Friday, 02 February 2007 @ 10:00 AM

Title: Community-Driven Ontology Matching

ESWC 2006

Author(s): Zhdanova, A., Shvaiko, P.

Available at:

Discussion Leader: Lois Delcambre

Friday, 26 January 2007 @ 10:00 AM

Title: Moirae: History-Enhanced Monitoring

CIDR '07

Author(s): Magdalena Balazinska, YongChul Kwon, Nathan Kuchta, and Dennis Lee

Available at:

Discussion Leader: Kristin Tufte

Friday, 19 January 2007 @ 10:00 AM

Title: Relational Lenses: A Language for Updatable Views

PODS '06

Author(s): Aaron Bohannon, Benjamin Pierce, and Jeffrey A. Vaughan

Available at:

Discussion Leader: James Terwilliger

Friday, 12 January 2007 @ 10:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:

Friday, 08 December 2006 @ 10:00 AM

Title: Using ECA Rules to Implement Mobile Query Agents for Fast-Evolving Pure P2P Database Systems

MDM '06

Author(s): Verena Kantere, and Aris Tsois

Available at:

Discussion Leader: Rafael J. Fernandez-Moctezuma

Friday, 01 December 2006 @ 10:00 AM

Title: Sudokus as Logical Puzzles


Author(s): Thomas Hillenbrand, Dalibor Topic, and Christoph Weidenback

Available at:

Discussion Leader: Nick Rayner

Friday, 17 November 2006 @ 10:00 AM

Title: Textpresso: An Ontology-Based Information Retrieval and Extraction System for Biological Literature

PLoS Biology

Author(s): Hans-Michael Muller, Eimear E. Kenny, and Paul W. Sternberg

Available at:

Discussion Leader: David Maier

Friday, 03 November 2006 @ 10:00 AM

Title: Magnet: Supporting Navigation in Semistructured Data Environments

Author(s): Sinha, V., Karger, D.

Available at:

Discussion Leader: David Archer

Friday, 27 October 2006 @ 10:00 AM

Title: Modeling Skew in Data Streams


Author(s): Flip Korn, S. Muthukrishnan, and Yihua Wu

Available at:

Discussion Leader: Jin Li

Friday, 20 October 2006 @ 10:00 AM

Title: User performance versus precision measures for simple search tasks


Author(s): Andrew Turpin and Falk Scholer

Available at:

Discussion Leader: Susan Price

Friday, 06 October 2006 @ 10:00 AM

Title: User-defined aggregate functions: bridging theory and practice


Author(s): Sara Cohen

Available at:

Discussion Leader: James Terwilliger

Friday, 15 September 2006 @ 10:00 AM

Title: Privacy enhancing identity management: protection against re-identification and profiling

Author(s): Sebastian Clauss, Dogan Kesdogan, Tobias Kolsch

Available at:

Discussion Leader: Nick Rayner

Friday, 08 September 2006 @ 10:00 AM

Title: On redundancy vs dependency preservation in normalization: an information-theoretic study of 3NF

PODS '06

Author(s): Solmaz Kolahi, and Leonid Libkin

Available at:

Discussion Leader: David Maier

Friday, 01 September 2006 @ 10:00 AM

Title: Personalizing Electronic Books

Author(s): Ohene-Djan, James, and Fernandes, Alvaro A. A.

Available at:

Discussion Leader: David Archer

Friday, 25 August 2006 @ 10:00 AM

Title: To Search or to Crawl? Towards a Query Optimizer for Text-Centric Tasks


Author(s): P. Ipeirotis, E. Agichtein, P. Jain, L. Gravano

Available at:

Discussion Leader: Laura Bright

Friday, 18 August 2006 @ 10:00 AM

Title: Buffer Pool Aware Query Optimization

CIDR '05

Author(s): Ravishankar Ramamurthy, and David J. DeWitt

Available at:

Discussion Leader: Len Shapiro

Friday, 11 August 2006 @ 10:00 AM

Title: How to cite curated databases and how to make them citable

Author(s): Peter Buneman

Available at:

Discussion Leader: Bill Howe

Friday, 04 August 2006 @ 10:00 AM

Title: Window-aware Load Shedding for Aggregation Queries over Data Streams

VLDB '06

Author(s): Nesime Tatbul, and Stan Zdonik

Available at:

Discussion Leader: Jin Li

Friday, 28 July 2006 @ 10:00 AM

Title: Relaxed Currency Serializability for Middle-Tier Caching and Replication


Author(s): P. A. Bernstein, A. Fekete, H. Guo, R. Ramakrishnan, P. Tamma

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 21 July 2006 @ 10:00 AM

Title: L-Diversity: Privacy Beyond k-Anonymity

ICDE '06

Author(s): Machanavajjhala, Gehrke, Kifer, Venkitasubramaniam

Available at:

Discussion Leader: James Terwilliger

Friday, 14 July 2006 @ 10:00 AM

Title: Incremental Test Collections

CIKM '05

Author(s): Ben Carterette and James Allan

Available at:

Discussion Leader: Susan Price

Friday, 07 July 2006 @ 10:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone

Friday, 16 June 2006 @ 10:00 AM

Title: RAM: Array Processing over a Relational DBMS

CWI Tech. Report

Author(s): A. R. van Ballegooij, A. P. de Vries, M. L. Kersten

Available at:

Discussion Leader: Bill Howe

Friday, 09 June 2006 @ 10:00 AM

Title: Cracking the Database Store

CIDR '05

Author(s): Martin Kersten and Stefan Manegold

Available at:

Discussion Leader: David Maier

Friday, 02 June 2006 @ 10:00 AM

Title: Supporting Exploratory Search (Communications of the ACM Special Issue)

Author(s): Ryen W. White, Bill Kules, Steven M. Drucker, and M. C. Schraefel (eds.)

Available at:

Discussion Leader: Susan Price

Friday, 26 May 2006 @ 10:00 AM

Title: Clean Answers Over Dirty Databases

ICDE '06

Author(s): Periklis Andritsos, Ariel Fuxman, and Rene Miller

Available at:

Discussion Leader: James Terwilliger

Friday, 19 May 2006 @ 11:00 AM

Title: Declarative Querying for Biological Sequences

ICDE 2006

Author(s): S. Tata, J. Patel, J. Friedman, A. Swaroop

Available at:

Discussion Leader: Laura Bright

Friday, 12 May 2006 @ 10:00 AM

Title: Tackling inconsistencies in data integration through source preferences

Author(s): G. De Giacomo, D. Lembo, M. Lenzerini, R. Rosati

Available at:

Discussion Leader: Nick Rayner

Friday, 05 May 2006 @ 10:00 AM

Title: Transaction Time Support Inside a Database Engine

ICDE 2006

Author(s): David Lomet, Roger Barga, Mohamed F. Mokbel, German Shegalov, Rui Wang, Yunye Zhu

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 28 April 2006 @ 10:00 AM

Title: Updates Through Views: A New Hope

Author(s): Yannis Kotidis, Divesh Srivastava, and Yannis Velegrakis

Available at:

Discussion Leader: David Archer

Friday, 21 April 2006 @ 10:00 AM

Title: Declarative Network Monitoring with an Underprovisioned Query Processor

ICDE '06

Author(s): Frederick Reiss and Joseph Hellerstein

Available at:

Discussion Leader: Jin Li

Friday, 14 April 2006 @ 10:00 AM

Title: Interconnections in multi-core architectures: Understanding Mechanisms, Overheads and Scaling

Author(s): Rakesh Kumar, Victor Zyuban, Dean Tullsen

Available at:

Discussion Leader: Kristin Tufte

Friday, 07 April 2006 @ 10:00 AM

Title: Why Do Computers Stop and What Can Be Done About It

Author(s): J. Gray

Available at:

Discussion Leader: David Maier

Friday, 24 March 2006 @ 10:00 AM

Title: A Report on the NSF-Sponsored Workshop on Personal Information Management, Seattle, WA, 2005

Author(s): William Jones, Harry Bruce

Available at:

Discussion Leader: Len Shapiro

Friday, 17 March 2006 @ 10:00 AM

Title: Unpublished draft

Author(s): U. C. Berkeley

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Kristin Tufte

Friday, 10 March 2006 @ 10:00 AM

Title: C-Store: A Column-oriented DBMS

VLDB '05

Author(s): Mike Stonebraker, Daniel Abadi, Adam Batkin, Xuedong Chen, Mitch Cherniack, Miguel Ferreira, Edmond Lau, Amerson Lin, Sam Madden, Elizabeth O'Neil, Pat O'Neil, Alex Rasin, Nga Tran, Stan Zdonik

Available at:

Discussion Leader: Laura Bright

Friday, 03 March 2006 @ 10:00 AM

Title: Enhancing P2P File-Sharing with an Internet-Scale Query Processor

VLDB '04

Author(s): Boon Thau Loo, Joseph M. Hellerstein, Ryan Huebsch, Scott Shenker, Ion Stoica

Available at:

Discussion Leader: Vassilis Papadimos

Thursday, 23 February 2006 @ 5:00 PM

Title: An Information Network Overlay Architecture for the NSDL

Author(s): Carl Lagoze, Dean B. Krafft, Susan Jesuroga, Tim Cornwell, Ellen J. Cramer, and Eddie Shin

Available at:

Discussion Leader: Lois Delcambre

Friday, 17 February 2006 @ 10:00 AM

Title: Information-Theoretic Tools for Mining Database Structure from Large Data Sets


Author(s): Periklis Andritsos, Renee J. Miller, and Panayiotis Tsaparas

Available at:

Discussion Leader: Nick Rayner

Friday, 10 February 2006 @ 10:00 AM

Title: Compiled Query Execution Engine using JVM

To appear in ICDE 2006

Author(s): Jun Rao, Hamid Pirahesh, C. Mohan, Guy Lohman

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Sudarshan Murthy

Friday, 03 February 2006 @ 10:00 AM

Title: Blueprints for ETL workflows

This is a longer version of a paper presented in ER 2005

Author(s): P. Vassiliadis, A. Simitsis, M. Terrovitis, S. Skiadopoulos

Available at:

Discussion Leader: James Terwilliger

Friday, 27 January 2006 @ 10:00 AM

Title: A Distributed Event Delivery Method with Load Balancing for MMORPG

Author(s): Shinya Yamamoto, Yoshihiro Murata, Keiichi Yasumoto, Minoru Ito

Available at:

Discussion Leader: Bill Howe

Friday, 27 January 2006 @ 10:00 AM

Title: Dynamic Microcell Assignment for Massively Multiplayer Online Gaming

Author(s): Bart De Vleeschauwer, Bruno Van Den Bossche, Tom Verdickt, Filip De Turck, Bart Dhoedt, Piet Demeester

Available at:

Discussion Leader: Bill Howe

Friday, 20 January 2006 @ 10:00 AM

Title: Reference reconciliation in complex information spaces

Author(s): Xin Dong, Alon Halevy, and Jayant Madhavan

Available at:

Discussion Leader: Susan Price

Friday, 13 January 2006 @ 10:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:

Friday, 09 December 2005 @ 10:00 AM

Title: The Google File System

SOSP 2003

Author(s): Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung

Available at:

Discussion Leader: David Maier

Friday, 02 December 2005 @ 10:00 AM

Title: Analyzing Plan Diagrams of Database Query Optimizers

VLDB '05

Author(s): Naveen Reddy and Jayant Haritsa

Available at:

Discussion Leader: Len Shapiro

Friday, 18 November 2005 @ 10:00 AM

Title: A Framework for Reliable and Efficient Data Placement in Distributed Computing Systems

To appear in Journal of Parallel and Distributed Computing, (2005)

Author(s): Tevfik Kosar and Miron Livny

Available at:

Discussion Leader: Laura Bright

Friday, 04 November 2005 @ 10:00 AM

Title: Semantic File Systems


Available at:

Discussion Leader: Eric Hanson

Friday, 28 October 2005 @ 10:00 AM

Title: MDL Summarization with Holes

VLDB '05

Author(s): Shaofeng Bu, Laks V.S. Lakshmanan, and Raymond T. Ng

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 21 October 2005 @ 10:00 AM

Title: Scientific Data Management in the Coming Decade

Microsoft Tech. Report

Author(s): Jim Gray, David T. Liu, Maria A. Nieto-Santisteban, Alexander S. Szalay, Gerd Heber, and David DeWitt

Available at:

Discussion Leader: Bill Howe

Friday, 21 October 2005 @ 10:00 AM

Title: Where the Rubber Meets the Sky: Bridging the Gap between Databases and Science

IEEE Data Engineering Bulletin, December 2004

Author(s): Jim Gray and Alex Szalay

Available at:

Discussion Leader: Bill Howe

Friday, 14 October 2005 @ 10:00 AM

Title: A Three-Layered XML View Model: A Practical Approach

ER '05

Author(s): Rajugan R., Elizabeth Chang, Tharam S. Dillon, and Ling Feng

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Sun Murthy

Friday, 07 October 2005 @ 10:00 AM

Title: Declarative Data Cleaning: Language, Model, and Algorithms

VLDB '01

Author(s): Helena Galhardas, Daniela Florescu, Dennis Shasha, Eric Simon, and Cristian-Augustin Saita

Available at:

Discussion Leader: James Terwilliger

Friday, 02 September 2005 @ 10:00 AM

Title: Towards a semantics for XML markup

Author(s): Allen Renear, David Dubin, and C. M. Sperberg-McQueen

Available at:

Discussion Leader: Susan Price

Friday, 02 September 2005 @ 10:00 AM

Title: Methods for the semantic analysis of document markup

Author(s): P. S. Bayerl et al.

Available at:

Discussion Leader: Susan Price

Wednesday, 31 August 2005 @ 4:15 PM

Title: DBRG hip-hop session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Everyone

Friday, 26 August 2005 @ 10:00 AM

Title: Principled Design of the Modern Web Architecture

ICSE 2000

Author(s): Roy T. Fielding, and Richard N. Taylor

Available at:

Discussion Leader: Sudarshan Murthy/Eric Hanson

Wednesday, 24 August 2005 @ 4:15 PM



Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Dave Maier

Friday, 19 August 2005 @ 10:00 AM

Title: Sampling Algorithms in a Stream Operator


Author(s): Theodore Johnson, S. Muthukrishnan, and Irina Rozenbaum

Available at:

Discussion Leader: Kristin Tufte

Wednesday, 17 August 2005 @ 4:15 PM

Title: The Object-Oriented Database System Manifesto

Author(s): M. Atkinson, F. Bancilhon, D. DeWitt, K. Dittrich, D. Maier, and S. Zdonik

Available at:

Discussion Leader: Lois Delcambre

Wednesday, 17 August 2005 @ 4:15 PM

Title: Third-generation Database System Manifesto

Author(s): The Committee for Advanced DBMS Function

Available at:

Discussion Leader: Lois Delcambre

Friday, 12 August 2005 @ 10:00 AM

Title: Stacked indexed views in microsoft SQL server


Author(s): David DeHaan, Per-Ake Larson, and Jingren Zhou

Available at:

Discussion Leader: David Maier

Wednesday, 10 August 2005 @ 4:15 PM

Title: A database perspective on the semantic web: a brief commentary

Author(s): Frank Manola

Available at:

Discussion Leader: Nick Rayner

Friday, 05 August 2005 @ 10:00 AM

Title: Automatic Performance Diagnosis and Tuning in Oracle

Author(s): Karl Dias, Mark Ramacher, Uri Shaft, Venkateshwaran Venkataramani, and Graham Wood

Available at:

Discussion Leader: Len Shapiro

Wednesday, 03 August 2005 @ 4:15 PM



Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Chris Dubay

Friday, 29 July 2005 @ 10:00 AM

Title: Update-Pattern-Aware Modeling and Processing of Continuous Queries


Author(s): Lukasz Golab, and M. Tamer Ozsu

Available at:

Discussion Leader: Jin Li

Wednesday, 20 July 2005 @ 4:15 PM

Title: Database systems: achievements and opportunities

Author(s): Avi Silberschatz, Michael Stonebraker, Jeff Ullman

Available at:

Discussion Leader: Susan Price

Wednesday, 20 July 2005 @ 4:15 PM

Title: The Lowell Database Research Self-Assessment Meeting


Available at:

Discussion Leader: Susan Price

Friday, 15 July 2005 @ 10:00 AM

Title: System RX: One Part Relational, One Part XML


Author(s): K. Beyer et al.

Available at:

Discussion Leader: Laura Bright

Wednesday, 13 July 2005 @ 4:15 PM

Title: On Six Degrees of Separation in DBLP-DB and More

Author(s): E. Elmacioglu and D. Lee

Available at:

Discussion Leader: Laura Bright

Friday, 08 July 2005 @ 10:00 AM

Title: A Graphical Language for Relational Multi-Database Querying and Restructuring

ICCI '98

Author(s): Fereidoon Sadri and Patrick L. Shouse

Available at:

Discussion Leader: James Terwilliger

Wednesday, 06 July 2005 @ 4:15 PM

Title: You and your Research

Author(s): Richard Hamming (transcribed by J. F. Kaiser)

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 01 July 2005 @ 10:00 AM

Title: Relational confidence bounds are easy with the bootstrap


Author(s): Abhijit Pol and Christopher Jermaine

Available at:

Discussion Leader: Vassilis Papadimos

Wednesday, 29 June 2005 @ 4:15 PM

Title: Rooter: A Methodology for the Typical Unification of Access Points and Redundancy

Author(s): Jeremy Stribling, Daniel Aguayo and Maxwell Krohn

Available at:

Discussion Leader: James Terwilliger

Friday, 24 June 2005 @ 10:00 AM

Title: Paper-choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:

Friday, 10 June 2005 @ 10:00 AM

Title: A Heartbeat Mechanism and its Application in Gigascope

To appear in VLDB 2005

Author(s): Theodore Johnson, S. Muthukrishnan, Vladislav Shkapenyuk, and Oliver Spatscheck

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Dave Maier

Friday, 03 June 2005 @ 10:00 AM

Title: Applying Model Management to Classical Meta Data Problems

CIDR '03

Author(s): Philip Bernstein

Available at:

Discussion Leader: James Terwilliger

Friday, 27 May 2005 @ 10:00 AM

Title: A Game Theoretic Framework for Incentives in P2P Systems

P2P '03

Author(s): Chiranjeeb Buragohain, Divyakant Agrawal

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 20 May 2005 @ 10:00 AM

Title: Robust and Fast Similarity Search for Moving Object Trajectories

Author(s): Lei Chen, Tamer Ozsu, Vincent Oria

Available at:

Discussion Leader: Bill Howe

Friday, 13 May 2005 @ 10:00 AM

Title: Vision Paper: Enabling Privacy for the Paranoids

Author(s): G. Aggarwal et al.

Available at:

Discussion Leader: Laura Bright

Friday, 13 May 2005 @ 10:00 AM

Title: Privacy-preserving data integration and sharing


Author(s): Chris Clifton, Murat Kantarcioglu, AnHai Doan, Gunther Schadow, Jaideep Vaidya, Ahmed Elmagarmind, Dan Suciu

Available at:

Discussion Leader: Nick Rayner

Friday, 06 May 2005 @ 10:00 AM

Title: QPipe: A Simultaneously Pipelined Relational Query Engine.


Author(s): S. Harizopoulos, V. Shkapenyuk, A. Ailamaki.

Available at:

Discussion Leader: Kristin Tufte

Friday, 29 April 2005 @ 10:00 AM

Title: An Evaluation of Non-Equijoin Algorithms

VLDB 1991

Author(s): David J. Dewitt, Jeffrey F. Naughton, Donovan A. Schneider

Available at:

Discussion Leader: Jin Li

Friday, 22 April 2005 @ 10:00 AM

Title: Search Middleware and the Simple Digital Library Interoperability Protocol

Author(s): Paepcke, A., Brandriff, R., Janee, G., Larson, R., Ludaescher, B., Melnik, S., Raghavan, S.

Available at:

Discussion Leader: Sun Murthy

Friday, 22 April 2005 @ 10:00 AM

Title: Core Services in the Architecture of the National Science Digital Library

JCDL '02

Author(s): Carl Lagoze, Walter Hoehn, David Millman, William Arms, Stoney Gan, Diane Hillmann, Christopher Ingram, Dean Krafft, Richard Marisa, Jon Phipps, John Saylor, Carol Terrizzi, Allan, Sergio Guzman-Lara, Tom Kalt

Available at:

Discussion Leader: Sun Murthy

Friday, 15 April 2005 @ 10:00 AM

Title: Implementing A Scalable XML Publish/Subscribe System Using Relational Database Systems


Author(s): Tian, Reinwald, Pirahesh, Mayr, and Myllymaki

Available at:

Discussion Leader: Susan Price

Friday, 08 April 2005 @ 10:00 AM

Title: RQL: A Declarative Query Language for RDF

Author(s): Gregory Karvounarakis, Vassilis Christophides, Sofia Alexaki, Dimitris Plexousakis, Michel Scholl

Available at:

Discussion Leader: Lois Delcambre

Friday, 01 April 2005 @ 10:00 AM

Title: Paper choosing session


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader:

Friday, 18 March 2005 @ 10:00 AM

Title: (Almost) Hands-off Information Integration for the Life Sciences

CIDR '05

Author(s): U. Leser, and F. Naumann

Available at:

Discussion Leader: Laura Bright

Friday, 11 March 2005 @ 10:00 AM

Title: Using Probabilistic Models for Data Management in Acquisitional Environments

CIDR '05

Author(s): A. Desphande, C. Guestrin, and S. Madden

Available at:

Discussion Leader: Kristin Tufte

Friday, 04 March 2005 @ 10:00 AM

Title: Towards a theory of natural language interfaces to databases

Author(s): Ana-Maria Popescu, Oren Etzioni, and Henry Kautz

Available at:

Discussion Leader: Nick Rayner

Friday, 25 February 2005 @ 10:00 AM

Title: The WebDAV Property Design

Author(s): E. James Whitehead, Jr., and Yaron Y. Goland

Available at:

Discussion Leader: Eric Hanson

Friday, 18 February 2005 @ 10:00 AM

Title: Supporting personal collections across digital libraries in spatial hypertext

JCDL '04

Author(s): Frank M. Shipman, Haowei Hsieh, J. Michael Moore, and Anna Zacchi

Available at:

Discussion Leader: David Maier

Friday, 11 February 2005 @ 10:00 AM

Title: Lessons Learned Managing a Petabyte

CIDR '05

Author(s): Jacek Becla and Daniel L. Wang

Available at:

Discussion Leader: Bill Howe

Friday, 04 February 2005 @ 10:00 AM

Title: Web-Scale Information Extraction in KnowItAll (Preliminary Results)

Author(s): Etzioni et al.

Available at:

Discussion Leader: Susan Price

Friday, 28 January 2005 @ 10:00 AM

Title: The Design of the Borealis Stream Processing Engine

CIDR '05

Author(s): Daniel J. Abadi, Yanif Ahmad, Magdalena Balazinska, Ugur Centintemel, Mitch Cherniack, Jeong-Hyon Hwang, Wolfgang Lindner, Anurag S. Maskey, Alexander Rasin, Esther Ryvkina, Nesime Tatbul, Ying Xing, and Stan Zdonik

Available at:

Discussion Leader: Jin Li

Friday, 21 January 2005 @ 10:00 AM

Title: Lazy Query Evaluation for Active XML


Author(s): Serge Abiteboul, Omar Benjelloun, Bogdan Cautis, Ioana Manolescu, Tova Milo, and Nicoleta Preda

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 10 December 2004 @ 10:00 AM

Title: Remembrance of Streams Past: Overload-Sensitive Management of Archived Streams

VLDB 2004

Author(s): Sirish Chandrasekaran and Michael Franklin

Available at:

Discussion Leader: Kristin Tufte

Friday, 03 December 2004 @ 10:00 AM

Title: SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases


Author(s): Mohamed F. Mokbel, Xiaopeng Xiong, and Walid G. Aref

Available at:

Discussion Leader: Jin Li

Friday, 19 November 2004 @ 10:00 AM

Title: DataMover: Robust Terabyte-Scale Multi-file Replication over Wide-Area Networks

SSDBM 2004

Author(s): A. Sim, J. Gu, A. Shoshani, V. Natarajan

Available at:

Discussion Leader: Laura Bright

Friday, 12 November 2004 @ 10:00 AM

Title: Estimating Progress of Execution for SQL Queries


Author(s): Chaudhuri S., Narasayya V., and Ramamurthy, R.,

Available at:

Discussion Leader: Len Shapiro

Friday, 05 November 2004 @ 10:00 AM

Title: Query Languages and Data Models for Database Sequences and Data Streams

VLDB 2004

Author(s): Yan-Nei Law, Haixun Wang, and Carlo Zaniolo

Available at:

Discussion Leader: Dave Maier

Friday, 29 October 2004 @ 10:00 AM

Title: XML Packaging

Author(s): W3C XML Packaging Working Group

Available at:

Discussion Leader: Eric Hanson

Friday, 29 October 2004 @ 10:00 AM

Title: Related-Resource Discovery for XML

Author(s): Tim Bray

Available at:

Discussion Leader: Eric Hanson

Friday, 29 October 2004 @ 10:00 AM

Title: Typekit

Author(s): Eric Hanson

Available at:

Discussion Leader: Eric Hanson

Friday, 22 October 2004 @ 10:00 AM

Title: Colorful XML: One Hierarchy Isn't Enough


Author(s): H. V. Jagadish, Laks V.S. Lakshmanan, Monica Scannapieco, Divesh Srivastava, and Nuwee Wiwatwattana

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 15 October 2004 @ 10:00 AM

Title: Unifying Tables, Objects and Documents


Author(s): Erik Meijer, Wolfram Schulte, and Gavin Bierman

Available at:

Discussion Leader: James Terwilliger

Friday, 08 October 2004 @ 10:00 AM

Title: Trio: A System for Integrated Management of Data, Accuracy, and Lineage

Author(s): Jennifer Widom

Available at:

Discussion Leader: Susan Price

Friday, 20 August 2004 @ 9:30 AM

Title: Efficient dynamic mining of constrained frequent sets

TODS 28(4), 2003

Author(s): Laks V. S. Lakshmanan, Carson Kai-Sang Leung, Raymond T. Ng

Available at:

Discussion Leader: Rafael Fernandez

Friday, 13 August 2004 @ 9:30 AM

Title: The Entity-Relationship model -- Toward a unified view of data

TODS 1(1), 1976

Author(s): Peter Pin-Shan Chen

Available at:

Discussion Leader: Susan Price

Friday, 06 August 2004 @ 9:30 AM

Title: Efficient Query Reformulation in Peer Data Management Systems


Author(s): Igor Tatarinov, and Alon Halevy

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 30 July 2004 @ 9:30 AM

Title: GridDB: A Data-Centric Overlay for Scientific Grids

VLDB 2004

Author(s): David Lu, and Michael Franklin

Available at:

Discussion Leader: Bill Howe

Friday, 23 July 2004 @ 9:30 AM

Title: Adapting to Source Properties in Processing Data Integration Queries

Author(s): Zachary G. Ives, Alon Halevy, and Daniel S. Weld

Available at:

Discussion Leader: Sun Murthy

Friday, 16 July 2004 @ 9:30 AM

Title: Holistic UDAFs at Streaming Speeds


Author(s): Graham Cormode, Theodore Johnson, Flip Korn, S. Muthukrishnan, Oliver Spatscheck, and Divesh Srivastava

Available at:

Discussion Leader: Jin Li

Friday, 09 July 2004 @ 9:30 AM

Title: A comprehensive XQuery to SQL translation using dynamic interval encoding


Author(s): David DeHaan, David Toman, Mariano P. Consens, and M. Tamer Oszu

Available at:

Discussion Leader: James Terwilliger

Friday, 02 July 2004 @ 9:30 AM

Title: Limiting Disclosure in Hippocratic Databases

VLDB 2004

Author(s): Kristen LeFevre, Rakesh Agrawal, Vuk Ercegovac, Raghu Ramakrishnan, Yirong Xu, and David DeWitt

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Kristin Tufte

Friday, 11 June 2004 @ 9:30 AM

Title: A Denotational Semantics for Continuous Queries over Streams and Relations

Author(s): Arvind Arasu and Jennifer Widom

Available at:

Discussion Leader: David Maier

Friday, 11 June 2004 @ 9:30 AM

Title: The CQL Continuous Query Language: Semantic Foundations and Query Execution

Author(s): Arvind Arasu, Shivnath Babu, and Jennifer Widom

Available at:

Discussion Leader: David Maier

Friday, 04 June 2004 @ 9:30 AM

Title: Measurement, Modeling, and Analysis of a Peer-to-Peer File-Sharing Workload

SOSP '03

Author(s): Krishna P. Gummadi, Richard J. Dunn, Stefan Saroiu, Steven D. Gribble, Henry M. Levy, and John Zahorjan

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 28 May 2004 @ 9:30 AM

Title: Processing Set Expressions over Continuous Update Streams


Author(s): Sumit Ganguly, Minos Garofalakis, and Rajeev Rastogi

Available at:

Discussion Leader: Kristin Tufte

Friday, 21 May 2004 @ 9:30 AM

Title: Passage Retrieval Based On Language Models


Author(s): Xiaoyong Liu and W. Bruce Croft

Available at:

Discussion Leader: Susan Price

Friday, 14 May 2004 @ 12:15 PM

Title: Optimizing Queries across Diverse Data Sources


Author(s): Laura M. Haas, Donald Kossmann, Edward L. Wimmers, and Jun Yang

Available at:

Discussion Leader: Sun Murthy

Friday, 30 April 2004 @ 9:30 AM

Title: A survey of data mining and knowledge discovery software tools

ACM SIGKDD Explorations, Vol. 1, Issue 1

Author(s): Michael Goebel and Le Gruenwald

Available at:

Discussion Leader: Rafael Fernandez

Friday, 23 April 2004 @ 9:30 AM

Title: Ad hoc Query Support for Very Large Simulation Mesh Data: the Metadata Approach.

Author(s): B.S. Lee, R.R. Snapp, R. Musick, and T. Critchlow

Available at:

Discussion Leader: Bill Howe

Friday, 16 April 2004 @ 9:30 AM

Title: Efficient Execution of Sliding-Window Queries over Data Streams

Purdue Technical Report

Author(s): Moustafa A. Hammad, Walid G. Aref, Michael J. Franklin, Mohamed F. Mokbel and Ahmed K. Elmagarmid

Available at:

Discussion Leader: Jin Li

Friday, 09 April 2004 @ 9:30 AM

Title: Tables As a Paradigm for Querying and Restructuring


Author(s): Marc Gyssens, Laks V. S. Lakshmanan , and Iyer N. Subramanian

Available at:

Discussion Leader: James Terwilliger

Friday, 19 March 2004 @ 9:30 AM

Title: Operator Scheduling in a Data Stream Manager

Author(s): Don Cranery, Ugur Cetintemel, Alex Rasin, Stan Zdonik, Mitch Cherniack, Mike Stonebraker

Available at:

Discussion Leader: Kristin Tufte

Friday, 12 March 2004 @ 9:30 AM

Title: Optimizing Fixed-Schema XML to SQL Query Translation

VLDB 2002

Author(s): Rajasekar Krishnamurthy, Raghav Kaushik and Jeffrey F. Naughton

Available at:

Discussion Leader: Fang Du

Friday, 27 February 2004 @ 9:30 AM

Title: Information Integration in Schema-Based Peer-to-Peer Networks

CAISE 2003

Author(s): Alexander Loeser, Wolf Siberski, Martin Wolpers and Wolfgang Nejdl

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 20 February 2004 @ 9:30 AM

Title: nD-SQL: A multi-dimensional language for interoperability and OLAP

VLDB 1998

Author(s): F. Gingras and L. V. S. Lakshmanan

Available at:

Discussion Leader: James Terwilliger

Friday, 13 February 2004 @ 9:30 AM

Title: Optimizing Queries across Diverse Data Sources

VLDB 1997

Author(s): Laura M. Haas, Donald Kossmann, and Edward L. Wimmers

Available at:

Discussion Leader: Sun Murthy

Friday, 06 February 2004 @ 9:30 AM

Title: GODIVA: Lightweight Data Management for Scientific Visualization Applications

ICDE 2004

Author(s): Xiaosong Ma, Marianne Winslett, John Norris, Xiangmin Jiao, and Robert Fiedler

Available at:

Discussion Leader: Laura Bright

Friday, 30 January 2004 @ 9:30 AM

Title: How Do People Get Back to Information on the Web? How Can They Do It Better?


Author(s): Jones, W., Bruce, H., Dumais, S.

Available at:,%20Bruce,%20Dumais%20submitted%20for%20review.doc

Discussion Leader: Dave Maier

Friday, 30 January 2004 @ 9:30 AM

Title: Once found, what then?: a study of "keeping" behaviors in personal use of Web information

ASIST 2002

Author(s): Jones, W., Dumais, S., and Bruce, H.

Available at:

Discussion Leader: Dave Maier

Friday, 30 January 2004 @ 9:30 AM

Title: A system for personal information retrieval and re-use

SIGIR 2003

Author(s): S. T. Dumais, E. Cutrell, E., J. J. Cadiz, G. Jancke, R. Sarin, and D. C. Robbins

Available at:

Discussion Leader: Dave Maier

Friday, 23 January 2004 @ 9:30 AM

Title: Scheduling for shared window joins over data streams

Author(s): Moustafa A. Hammad, Micheal J. Franklin, Walid G. Arel, and Ahmed K. Elmagarmid

Available at:

Discussion Leader: Jin Li

Friday, 16 January 2004 @ 9:30 AM

Title: Querying Heterogeneous XML Sources through a Conceptual Schema

ER 2003

Author(s): Sandro Daniel Camillo, Carlos Alberto Heuser, and Ronaldo dos Santos Mello

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre

Friday, 12 December 2003 @ 9:30 AM

Title: Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery

Author(s): S. Chakrabarti, M. van den Berg, and B. Dom

Available at:

Discussion Leader: Vinit Kalra

Friday, 21 November 2003 @ 9:30 AM

Title: Sorting And Indexing With Partitioned B-Trees

Proceedings of CIDR 2003

Author(s): Goetz Graefe

Available at:

Discussion Leader: Dave Maier

Friday, 14 November 2003 @ 9:30 AM

Title: Locating Data Sources in Large Distributed Systems

Proceedings of VLDB 2003

Author(s): Leonidas Galanis, Yuan Wang, Shawn R. Jeffery, David J. DeWitt

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 07 November 2003 @ 9:30 AM

Title: A Data Model for Distributed Multiresolution Multisource Scientific Data

Proceedings of ISOFSEM 2002

Author(s): Philip J. Rhodes, R. Daniel Bergeron, and Ted M. Sparr

Available at:

Discussion Leader: Bill Howe

Friday, 31 October 2003 @ 9:30 AM

Title: Notions of Indistinguishability for Semantic Web Languages

You can also find a hard copy of the paper in the bin in the CSE Compton reception area

Author(s): Jaap Kamps, Maarten Marx

Available at:

Discussion Leader: Susan Price

Friday, 24 October 2003 @ 9:30 AM

Title: Extending the Role of Digital Library: Computer Support for Creating and Using the Literature

You can also find a hard copy of the paper in the bin in the CSE Compton reception area

Author(s): L. Carr, T. Miles-Board, G. Wills, G. Power, C. Bailey, W. Hall, and S. Grange

Available at:

Discussion Leader: Sun Murthy

Friday, 17 October 2003 @ 9:30 AM

Title: Aurora: A New Model and Architecture for Data Stream Management

Author(s): D. Abadi, D. Carney, U. Cetintemel, M. Cherniack, C. Convey, S. Lee, M. Stonebraker, N. Tatbul, and S. Zdonik

Available at:

Discussion Leader: Jin Li

Friday, 10 October 2003 @ 9:30 AM

Title: MetaXPath

Proceedings of 2001 Intl. Conf. on Dublin Core and Metadata Applications

Author(s): C. E. Dyreson, M. H. Bohlen, and C. S. Jensen

Available at:

Discussion Leader: Fang Du

Friday, 05 September 2003 @ 9:30 AM

Title: Query Processing of Streamed XML Data

11th International Conference on Information and Knowledge Management (CIKM'2002). McLean, VA, November 2002

Author(s): 1. Leonidas Fegaras, David Levine, Sujoe Bose, and Vamsi Chaluvadi

Available at:

Discussion Leader: Dave Maier

Friday, 29 August 2003 @ 9:30 AM

Title: E-services: A Look Behind the Curtain

PODS 2003

Author(s): Richard Hull, Michael Benedikt, Vassilis Christophides, Jianwen Su

Available at:

Discussion Leader: Juliana Freire

Friday, 22 August 2003 @ 9:30 AM

Title: Topical Relevance Relationships. I. Why Topic Matching Fails

Journal of the American Society for Information Science 46(9): 646-653, 1995

Author(s): Rebecca Green

Available at:

Discussion Leader: Susan Price

Friday, 15 August 2003 @ 9:30 AM

Title: Querying Structured Text in an XML Database


Author(s): Shurug Al-Khalifa, Cong Yu, H. V. Jagadish

Available at:

Discussion Leader: Sun Murthy

Friday, 08 August 2003 @ 9:30 AM

Title: Window Explained, Window Expressed

Author(s): Sirish Chandrasekaran, Sailesh Krishnamurthy, Samuel Madden, Amol Deshpande, Micheal J. Franklin, Joseph M. Hellerstein, Mehul Shah

Available at:

Discussion Leader: Jenny Li

Friday, 01 August 2003 @ 9:30 AM

Title: Scientific Data Repositories - Designing for a Moving Target


Author(s): Etzard Stolte, Christoph von Praun, Gustavo Alonso, Thomas Gross

Available at:

Discussion Leader: Bill Howe

Friday, 25 July 2003 @ 9:30 AM

Title: Cache-and-Query for Wide Area Sensor Databases


Author(s): Amol Deshpande, Suman Nath, Phillip B. Gibbons, Srinivasan Seshan

Available at:

Discussion Leader: Lingzhi Zhang

Friday, 11 July 2003 @ 9:30 AM

Title: Warping Indexes with Envelope Transforms for Query by Humming


Author(s): Yunyue Zhu, Dennis Shasha

Available at:

Discussion Leader: Pete Tucker

Friday, 13 June 2003 @ 9:30 AM

Title: Leveraging a Common Representation for Personalized Search and Summarization in a Medical Digital Library

Proc. of JCDL, 2003, Houston, TX

Author(s): Kathleen McKeown, Noemie Elhadad and Vassilis Hatzivassiloglou

Available at:

Discussion Leader: Susan Price

Friday, 06 June 2003 @ 9:30 AM

Title: The hypercontext framework for adaptive Hypertext

The proceedings of the thirteenth conference on hypertext and hypermedia June 11-15, 2002, College Park, Maryland, USA; Pages 11-20.

Author(s): Christopher D. Staff

Available at:

Discussion Leader: Sun Murthy

Friday, 30 May 2003 @ 9:30 AM

Title: Beyond Average: Towards Sophisticated Sensing with Queries

2nd International Workshop on Information Processing in Sensor Networks (IPSN '03)

Author(s): Joseph M. Hellerstein, Wei Hong, Samuel Madden, and Kyle Stanek

Available at:

Discussion Leader: Pete Tucker

Friday, 23 May 2003 @ 9:30 AM

Title: SQL and Management of External Data

SIGMOD Record Volume 30 Number 1 March 2001

Author(s): J. Melton, J. Michels, V. Josifovski, K. Kulkarni, P. Schwarz, K. Zeidenstein

Available at:

Discussion Leader: Dave Maier

Friday, 23 May 2003 @ 9:30 AM

Title: SQL/MED - A Status Report

SIGMOD Record Volume 31 Number 3 September 2002

Author(s): J. Melton, J. Michels, V. Josifovski, K. Kulkarni, P. Schwarz

Available at:

Discussion Leader: Dave Maier

Friday, 09 May 2003 @ 9:30 AM

Title: Comparing Sets of Semantic Relations in Ontologies

Author(s): Eduard Hovy

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre

Friday, 02 May 2003 @ 9:30 AM

Title: Incremental Validation of XML Documents

ICDT 2003, LNCS 2572, p 64-79

Author(s): Yannis Papakonstantinou and Victor Vianu

Available at:

Discussion Leader: Denilson Barbosa

Friday, 18 April 2003 @ 9:30 AM

Title: Dynamic XML Documents with Distribution and Replication


Author(s): Serge Abiteboul, Angela Bonifati, Gregory Cobena, Ioana Manolescu, and Tova Milo

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 11 April 2003 @ 9:30 AM

Title: Crawling the Hidden Web

VLDB 2001

Author(s): Sriram Raghavan and Hector Garcia-Molina

Available at:

Discussion Leader: Vinit Kalra

Friday, 21 March 2003 @ 9:30 AM

Title: SEQ: A Model for Sequence Databases.

Author(s): Praveen Seshadri, Miron Livny and Raghu Ramakrishnan.

Available at:

Discussion Leader: Pete Tucker

Friday, 07 March 2003 @ 9:30 AM

Title: Crossing the Structure Chasm

CIDR 2003

Author(s): Alon Halevy, Oren Etzioni, AnHai Doan, Zachary Ives, Jayant Madhavan, Luke McDowell, and Igor Tatarinov

Available at:

Discussion Leader: Sun Murthy

Friday, 28 February 2003 @ 9:30 AM

Title: Online Aggregation

Author(s): Joseph M. Hellerstein, Peter J. Haas, and Helen J. Wang

Available at:

Discussion Leader: Jin Li

Friday, 21 February 2003 @ 9:30 AM

Title: Validating streaming XML documents

Author(s): Luc Segoufin, and Victor Vianu

Available at:

Discussion Leader: Lingzhi Zhang

Friday, 14 February 2003 @ 9:30 AM

Title: Decomposition - A Strategy for Query Processing

TODS 1(3): 223-241

Author(s): Eugene Wong, and Karel Youssefi

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 07 February 2003 @ 9:30 AM

Title: Efficient Exploration of Large Scientific Databases.

VLDB 2002

Author(s): Etzard Stolte, and Gustavo Alonso.

Available at:

Discussion Leader: Bill Howe

Friday, 24 January 2003 @ 9:30 AM

Title: A Mapping Schema and Interface for XML Stores

Author(s): Sihem Amer-Yahia

Available at:

Discussion Leader: Fang Du

Friday, 17 January 2003 @ 9:30 AM

Title: The Yin/Yang Web: XML Syntax and RDF Semantics


Author(s): Peter Patel-Schneider and Jerome Simeon

Available at:

Discussion Leader: Susan Price

Friday, 13 December 2002 @ 9:30 AM

Title: Self-tuning Database Technology and Information Services: from Wishful Thinking to Viable Engineering

Author(s): Gerhard Weikum, Axel Moenkeberg, Christof Hasse, and Peter Zabback

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 06 December 2002 @ 9:30 AM

Title: LEO - DB2's LEarning Optimizer.

VLDB 2001

Author(s): Michael Stillger, Guy M. Lohman, Volker Markl, and Mokhtar Kandil

Available at:

Discussion Leader: Juliana Freire

Friday, 22 November 2002 @ 9:30 AM

Title: Learning to Map between Ontologies on the Semantic Web

Author(s): Doan, Madhavan, Domingos, and Halevy

Available at:

Discussion Leader: Susan Price

Friday, 15 November 2002 @ 9:30 AM

Title: Storing and Querying Ordered XML Using a Relational Database System


Author(s): Igor Tatarinov, Stratis Viglas, Kevin S. Beyer, Jayavel Shanmugasundaram, Eugene J. Shekita, and Chun Zhang

Available at:

Discussion Leader: Lingzhi Zhang

Friday, 08 November 2002 @ 9:30 AM

Title: Translating Web Data

Proceedings of VLDB 2002, Hong Kong SAR, China

Author(s): Lucian Popa, Yannis Velegrakis, Renee J. Miller, Mauricio A. Hernandez, and Ronald Fagin

Available at:

Discussion Leader: Lois Delcambre

Friday, 01 November 2002 @ 9:30 AM

Title: A Transducer-Based XML Query Processor

Author(s): Bertram Ludaescher, Pratik Mukhopadhyay, and Yannis Papakonstantinou

Available at:

Discussion Leader: Dave Maier

Friday, 25 October 2002 @ 9:30 AM

Title: Pipelining in Multi-Query Optimization

PODS 2001

Author(s): Nilesh N. Dalvi, Sumit K. Sanghai, Prasan Roy, and S. Sudarshan

Available at:

Discussion Leader: Bill Howe

Friday, 18 October 2002 @ 9:30 AM

Title: Augmenting Thesaurus Relationships: Possibilities for Retrieval

Author(s): Douglas Tudhope, Harith Alani, and Christopher Jones

Available at:

Discussion Leader: Mat Weaver

Friday, 04 October 2002 @ 10:25 AM



Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre or Shawn Bowers

Friday, 13 September 2002 @ 9:30 AM

Title: ExpansionTool: Concept Based Query Expansion and Construction

Information Retrieval 4(3/4), pp. 231-255

Author(s): Kalervo Jarvelin, Jaana Kekalainen, Timo Niemi

Available at:

Discussion Leader: Mat Weaver

Friday, 06 September 2002 @ 9:30 AM

Title: How To Query Network Traffic Data Using Data Streams

Author(s): Chuck Cranor, Theodore Johnson and Oliver Spatscheck

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Pete Tucker

Friday, 23 August 2002 @ 9:30 AM

Title: Hyperqueries: Dynamic Distributed Query Processing on the Internet

VLDB 2001

Author(s): Alfons Kemper, Christian Wiesner

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 16 August 2002 @ 9:30 AM

Title: What can databases do for Peer-to-Peer?

WebDB 2001

Author(s): Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 09 August 2002 @ 9:30 AM

Title: Data-Driven Understanding and Refinement of Schema Mappings


Author(s): Yan, Miller, Haas, Fagin

Available at:

Discussion Leader: Shawn Bowers

Friday, 26 July 2002 @ 9:30 AM

Title: Compound Descriptors in Context: A Matching Function for Classifications and Thesauri

In JCDL 2002, pp. 84--93

Author(s): Douglas Tudhope, Ceri Binding, Dorothee Blocks, Daniel Cunliffe

Available at:

Discussion Leader: Lois Delcambre

Friday, 26 July 2002 @ 9:30 AM

Title: A Methodology and System for Preserving Digital Data

In JCDL 2002, pp. 312--319

Author(s): Raymond A. Lorie

Available at:

Discussion Leader: Lois Delcambre

Friday, 19 July 2002 @ 9:30 AM

Title: Rules of Thumb in Data Engineering

ICDE 2000

Author(s): Jim Gray, Prashant J. Shenoy

Available at:

Discussion Leader: Dave Maier

Friday, 12 July 2002 @ 9:30 AM

Title: Annotea: An Open RDF Infrastructure for Shared Web Annotations

WWW10 2001 Hong Kong

Author(s): Jose Kahan, Marja-Riitta Koivunen

Available at:

Discussion Leader: Sun Murthy

Friday, 28 June 2002 @ 9:30 AM

Title: Chimera: A Virtual Data System or Representing, Querying, and Automating Data Derivation

SSDBM 2002

Author(s): Ian Foster

Available at:

Discussion Leader: Bill Howe

Friday, 14 June 2002 @ 9:30 AM

Title: Holistic Twig Joins: Optimal XML Pattern Matching

In SIGMOD 2002

Author(s): Nicolas Bruno, Divesh Srivastava, Nick Koudas

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 24 May 2002 @ 9:30 AM

Title: Mixing querying and navigation in MIX.

In Proc. of the 18th International Conference on Data Engineering, pp. 245--254, 2002.

Author(s): P. Mukhopadhyay and Y. Papakonstantinou

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Shawn Bowers

Friday, 10 May 2002 @ 9:30 AM

Title: Fine Grained Access Control for SOAP E-Services

In: Proceedings of Tenth International World Wide Web Conference (WWW10); 2001; May 1-5; Hong Kong.

Author(s): Damiani E, Vimercati S, Paraboschi S, Samarati P.

Available at:

Discussion Leader: Sun Murthy

Friday, 10 May 2002 @ 9:30 AM

Title: Latency Performance of SOAP Implementations.

To be published in: Proceedings of 2nd International Workshop on Global and Peer-to-Peer on Large Scale Distributed Systems. IEEE International Symposium on Cluster Computing and the Grid; 2002; May; Berlin, Germany.

Author(s): Davis D, Parashar M.

Available at:

Discussion Leader: Sun Murthy

Friday, 03 May 2002 @ 9:30 AM

Title: Continuously Adaptive Continuous Queries over Streams

In SIGMOD 2002

Author(s): Samuel Madden, Mehul Shah, Joseph M. Hellerstein, Vijayshankar Raman

Available at:

Discussion Leader: Dave Maier

Friday, 26 April 2002 @ 9:30 AM

Title: Monitoring Streams -- A New Class of Data Management Applications

Brown CS Tech Report TR-CS-02-04

Author(s): Don Carney et al.

Available at:

Discussion Leader: Pete Tucker

Friday, 19 April 2002 @ 9:30 AM

Title: A Framework of Guidance for Building Good Digital Collections

This is a draft version of an unpublished paper -- please do not redistribute.


Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Lois Delcambre

Friday, 12 April 2002 @ 9:30 AM

Title: Scientific Workflow Management by Database Management

In SSDBM '98

Author(s): Anastassia Ailamaki, Yannis E. Ioannidis, Miron Livny

Available at:

Discussion Leader: Bill Howe

Friday, 22 March 2002 @ 9:30 AM

Title: On bounding-schemas for LDAP directories.

In Proceedings of the 7th International Conference on Extending Database Technology, Konstanz, Germany

Author(s): S. Amer-Yahia, H. Jagadish, L. Lakshmanan, D. Srivastava.

Available at:

Discussion Leader: Shawn Bowers

Friday, 08 March 2002 @ 9:30 AM

Title: A Formal Ontology of Properties

EKAW 2000

Author(s): Nicola Guarino and Christopher Welty

Available at:

Discussion Leader: Mat Weaver

Friday, 15 February 2002 @ 9:30 AM

Title: A Unified Framework for Data Translation over the Web

Both this and the next paper will be discussed in the same session!

Author(s): Ricardo Torlone and Paolo Atzeni

Available at:

Discussion Leader: Lois Delcambre

Friday, 15 February 2002 @ 9:30 AM

Title: VIKI: Spatial Hypertext Supporting Emergent Structure

Author(s): Catherine C. Marshall, Frank M. Shipman III, James H. Coombs

Available at:

Discussion Leader: Lois Delcambre

Friday, 08 February 2002 @ 9:30 AM

Title: Fjording the Stream: An Architecture for Queries over Streaming Sensor Data

To appear in ICDE 2002

Author(s): Samuel Madden, Michael J. Franklin

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Pete Tucker

Friday, 01 February 2002 @ 9:30 AM

Title: Java Support for Data-Intensive Systems: Experiences Building the Telegraph Dataflow System

SIGMOD Record 30(4), December 2001

Author(s): M. A. Shah, S. Madden, M. Franklin, and J.M. Hellerstein

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 25 January 2002 @ 9:30 AM

Title: Supporting user-defined activity spaces.

In proceedings of: Conference on Hypertext and Hypermedia 1997; Southampton; UK. Pages 112-123;

Author(s): Wang W, Haake J

Available at:

Discussion Leader: Sun Murthy

Friday, 18 January 2002 @ 9:30 AM

Title: The Roma personal Metadata service

to appear in Mobile Networks and Applications (MONET) 2002

Author(s): Edward Swierk, Emre Kman, Nathan C. Williams, Takashi Fukushima, Hideki Yoshida, Vince Laviano and Mary Baker

Available at:

Discussion Leader: Bill Howe

Friday, 11 January 2002 @ 9:30 AM

Title: Archiving Scientific Data

Author(s): Buneman, Khanna, Tajima, Tan

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Dave Maier

Friday, 07 December 2001 @ 9:30 AM

Title: Hypertext Interaction Revisited

Author(s): G. Golovchinsky and C. Marshall

Available at:

Discussion Leader: Sun Murthy

Friday, 30 November 2001 @ 9:30 AM

Title: Surfing Wavelets in Streams: One-Pass Summaries for Approximate Aggregate Queries

VLDB 2001

Author(s): A. Gilbert et al.

Available at:

Discussion Leader: Pete Tucker

Friday, 16 November 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Bill Howe

Friday, 16 November 2001 @ 9:30 AM

Title: Querying the Physical World

Author(s): Bonnet, Gehrke, Seshadri

Available at:

Discussion Leader: Bill Howe

Friday, 02 November 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 26 October 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Dave Maier

Friday, 19 October 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Lois Delcambre

Friday, 12 October 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Mat Weaver

Friday, 05 October 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Shawn Bowers

Friday, 31 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Dave

Friday, 24 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Mat

Friday, 17 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Sun

Friday, 10 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Lois

Friday, 03 August 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Everyone

Friday, 27 July 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Pete Tucker

Friday, 20 July 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: David Maier

Friday, 13 July 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Vassilis Papadimos

Friday, 01 June 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Pete Tucker

Friday, 25 May 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Mathew Weaver

Friday, 11 May 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Foula Vagena

Friday, 04 May 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Dave Maier

Friday, 27 April 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Shawn Bowers

Friday, 13 April 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Vasileios Papadimos

Friday, 02 March 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Foula Vagena

Friday, 23 February 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Lois Delcambre

Friday, 16 February 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Pete Tucker

Friday, 09 February 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Shawn Bowers

Friday, 02 February 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Mathew Weaver

Friday, 26 January 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: David Maier

Friday, 12 January 2001 @ 9:30 AM

Title: A Search Engine for Natural Language Applications

WWW 2005

Author(s): Michael Cafarella and Oren Etzioni

Available at:

Discussion Leader: Kevin Beck

Wednesday, 31 December 1969 @ 4:00 PM

Title: Privacy-preserving record linkage using Bloom filters

BMC Medical Informatics and Decision Making 2009, 9:41

Author(s): Rainer Schnell, Tobias Bachteler and Jörg Reiher

Available at:

Discussion Leader: Abdussalam Alawini

Wednesday, 31 December 1969 @ 4:00 PM

Title: Photon: Fault-tolerant and Scalable Joining of Continuous Data Streams


Author(s): Rajagopal Ananthanarayanan, Venkatesh Basker, Sumit Das, Ashish Gupta, Haifeng Jiang, Tianhao Qiu, Alexey Reznichenko, Deomid Ryabkov, Manpreet Singh, Shivakumar Venkataraman

Available at:

Discussion Leader: Chris

Wednesday, 31 December 1969 @ 4:00 PM

Title: Coordination Avoidance in Database Systems

VLDB 2015 - Proceedings of the VLDB Endowment, Vol. 8, No. 3

Author(s): Peter Bailis, Alan Fekete, Michael J. Franklin, Ali Ghodsi, Joseph M. Hellerstein, Ion Stoica

Available at:

Discussion Leader: Jeremiah Peschka

Wednesday, 31 December 1969 @ 4:00 PM

Title: Customized Random Walk for Generating Wikipedia Article Recommendations

Not published

Author(s): Jocelyn Hickcox and Chris Min

Available at:

Discussion Leader: Hisham

Wednesday, 31 December 1969 @ 4:00 PM

Title: Rough sets and intelligent data analysis

Information Sciences 147 (2002) 1�12

Author(s): Zdzisław Pawlak

Available at:

Discussion Leader: Basem

Wednesday, 31 December 1969 @ 4:00 PM

Title: Seven Databases in Seven Weeks

CMU Seminar series

Author(s): CMU Seminar series

Available at:

Discussion Leader: Dave

Wednesday, 31 December 1969 @ 4:00 PM

Title: Plenario: An Open Data Discovery and Exploration Platform for Urban Science

IEEE Data Engineering Bulletin, Dec 2014

Author(s): C. Cattlett et al.

Available at:

Discussion Leader: Dave

Wednesday, 31 December 1969 @ 4:00 PM

Title: The Snowflake Elastic Data Warehouse

SIGMOD '16 Proceedings of the 2016 International Conference on Management of Data Pages 215-226 ACM

Author(s): Benoit Dageville, Thierry Cruanes, Marcin Zukowski, Vadim Antonov, Artin Avanes, Jon Bock, Jonathan Claybaugh, Daniel Engovatov, Martin Hentschel, Jiansheng Huang, Allison W. Lee, Ashish Motivala, Abdul Q. Munir, Steven Pelley, Peter Povinec, Greg Rahn, S

Available at: Not available online. Pick up a hardcopy or contact discussion leader.

Discussion Leader: Shree

Wednesday, 31 December 1969 @ 4:00 PM

Title: Context-Based Event Processing Systems

SpringerLink: Studies in Computational Intelligence, vol 347. Springer, Berlin, Heidelberg

Author(s): Opher Etzion, Yonit Magid, Ella Rabinovich, Inna Skarbovsky, and Nir Zolotorevsky

Available at:

Discussion Leader: Hong Quach