Publications and Patents
Publications
Y. Gong, Chuan Lei, X. Qin, K. Vaidya, M. Narayanaswamy, T. Kraska: SQLens: An End-to-End Framework for Error Detection and Correction in Text-to-SQL, NeurIPS 2025
PDFC. Koutras, J. Zhang, X. Qin, Chuan Lei, V. Ioannidis, C. Faloutsos, G. Karypis, A. Katsifodimos: OmniMatch: Joinability Discovery in Data Products, PVLDB 18(11): 4588-4601 (2025)
PDFJ. Liang, Chuan Lei, X. Qin, J. Zhang, A. Katsifodimos, C. Faloutsos, H. Rangwala: FEATPILOT: Automatic Feature Augmentation on Tabular Data, ICDE 2025: 2148-2160
PDFSLIDESX. Hu, Chuan Lei, X. Qin, A. Katsifodimos, C. Faloutsos, H. Rangwala: PolyJoin: Semantic Multi-key Joinable Table Search in Data Lakes, NAACL (Findings) 2025: 384-395
PDFX. Hu, X. Qin, Chuan Lei, A. Katsifodimos, Z. Shen, B. Srinivasan, H. Rangwala: DiscoverGPT: Multi-task Fine-tuning Large Language Model for Related Table Discovery, NAACL (Findings) 2025: 358-373
PDFM. Wang, Q. Gan, D. Wipf, et al.: 4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on RDBs, NeurIPS 2024 Datasets and Benchmarks Track
PDFY. Lou, Chuan Lei, X. Qin, Z. Wang, C. Faloutsos, R. Anubhai, H. Rangwala: DataLore: can LLM find all lost scrolls in a data repository?, ICDE 2024 Industrial and Applications Track
PDFSLIDESK. Kong, J. Zhang, Z. Shen, B. Srinivasan, Chuan Lei, C. Faloutsos, H. Rangwala, G. Karypis: OpenTab: Advancing Large Language Models as Open-domain Table Reasoners, ICLR 2024
PDFV. Meduri, A. Quarmar, Chuan Lei, X. Qin, B. Reinwald: ALFA: Active Learning for Graph Neural Network-Based Semantic Schema Alignment, The VLDB Journal (2023)
PDFX. Hu, S. Wang, X. Qin, Chuan Lei, Z. Shen, C. Faloutsos, A. Katsifodimos, G. Karypis, L. Wen and P. S. Yu: Automatic Table Union Search with Tabular Representation Learning, ACL (Findings) 2023: 3786–3800
PDFX. Qin, N. Sheikh, Chuan Lei, B. Reinwald, G. Domeniconi: SEIGN: A Simple and Efficient Graph Neural Network for Large Dynamic Graphs, ICDE 2023: 2841-2854
PDFSLIDESPOSTERChuan Lei, A. Quamar, V. Efthymiou, F. Özcan, R. Alotaibi: HERMES: Data Placement and Schema Optimization for Enterprise Knowledge Bases, The VLDB Journal 32(3), 549–574 (2023)
PDFA. Quamar, V. Efthymiou, Chuan Lei, F. Özcan: Natural Language Interfaces to Data, Foundations and Trends® in Databases: Vol. 11: No. 4, 319-414
PDFN. Sheikh, X. Qin, B. Reinwald, Chuan Lei: Scaling Knowledge Graph Embedding Models for Link Prediction, EuroMLSys@EuroSys 2022: 87-94
PDFL. Ma, Chuan Lei, O. Poppe, E. Rundensteiner: Gloria: Graph-based Sharing Optimizer for Event Trend Aggregation, SIGMOD 2022: 1122-1135
PDFSLIDESJ. Hao, Chuan Lei, V. Efthymiou, A. Quamar, F. Özcan, Y. Sun, W. Wang: MEDTO: Medical Data to Ontology Matching Using Hybrid Graph Neural Networks, SIGKDD 2021: 2946-2954
PDFSLIDESA. Vretinaris, Chuan Lei, V. Efthymiou, X. Qin, F. Özcan: Medical Entity Disambiguation Using Graph Neural Networks, SIGMOD 2021: 2310-2318
PDFSLIDESO. Poppe, Chuan Lei, L. Ma, A. Rozet, E. Rundensteiner: To Share, or not to Share Online Event Trend Aggregation Over Bursty Event Streams, SIGMOD 2021: 1452-1464
PDFSLIDESF. Özcan, Chuan Lei, A. Quamar, V. Efthymiou: Semantic Enrichment of Data for AI Applications, Invited Paper, 5th DEEM@SIGMOD 2021
PDFSLIDESS. Ahmetaj, V. Efthymiou, R. Fagin, P. G. Kolaitis, Chuan Lei, F. Özcan, L. Popa: Ontology-Enriched Query Answering on Relational Databases, IAAI 2021: 15247-15254
PDFSLIDESCODER. Alotaibi, Chuan Lei, A. Quamar, V. Efthymiou, F. Özcan: Property Graph Schema Optimization for Domain-Specific Knowledge Graphs, ICDE 2021: 924-935
PDFSLIDESJ. Sen, Chuan Lei, A. Quamar, F. Özcan, V. Efthymiou, A. Dalmia, G. Stager, A. Mittal, D. Saha, K. Sankaranarayanan: ATHENA++: Natural Language Querying for Complex Nested SQL Queries, PVLDB 13(11): 2747-2759 (2020)
PDFSLIDESDATAX. Han, L. Hu, J. Sen, Y. Dang, B. Gao, V. Isahagian, Chuan Lei, V. Efthymiou, F. Özcan, A. Quamar, Z. Huang, V. Muthusamy: Bootstrapping Natural Language Querying on Process Automation Data, IEEE SCC 2020: 170-177
PDFA. Rozet, O. Poppe, Chuan Lei, E. Rundensteiner: Muse: Multi-query Event Trend Aggregation, CIKM 2020: 2193-2196 (Best Short Paper Award)
PDFSLIDESF. Özcan, A. Quamar, J. Sen, Chuan Lei, V. Efthymiou: State of the Art and Open Challenges in Natural Language Interfaces to Data, SIGMOD 2020 (Tutorial): 2629-2636
PDFSLIDESA. Quamar, Chuan Lei, D. Miller, F. Özcan, J. Kreulen, R. J. Moore, V. Efthymiou: An Ontology-Based Conversation System for Knowledge Bases. SIGMOD 2020: 361-376
PDFSLIDES
Authors are listed in alphabetical order by authors' first names.Chuan Lei, V. Efthymiou, R. Geis and F. Özcan: Expanding Query Answers on Medical Knowledge Bases. EDBT 2020: 567-578
PDFSLIDESO. Poppe, Chuan Lei, E. A. Rundensteiner, and D. Maier: Event Trend Aggregation Under Rich Event Matching Semantics. SIGMOD 2019: 555-572
PDFSLIDESJ. Sen, F. Özcan, A. Quamar, G. Stager, A. Mittal, M. Jammi, Chuan Lei, D. Saha, K. Sankaranarayanan: Natural Language Querying of Complex Business Intelligence Queries. SIGMOD 2019: 1997-2000
PDF,POSTERChuan Lei, F. Özcan, A. Quamar, et al.: Ontology-Based Natural Language Query Interfaces for Data Exploration. Invited Paper, IEEE Data Eng. Bull. 41(3): 52-63 (2018)
PDFO. Poppe, A. Rozet, Chuan Lei, E. A. Rundensteiner, and D. Maier: Sharon: Shared Online Event Sequence Aggregation. ICDE 2018: 737-748
PDFSLIDESO. Poppe, Chuan Lei, E. A. Rundensteiner, and D. Maier: GRETA: Graph-based Real-time Event Trend Aggregation. PVLDB 11(1): 80-92 (2018)
PDFSLIDESO. Poppe, Chuan Lei, S. Ahmed, and E. A. Rundensteiner: Complete Event Trend Detection in High-Rate Event Streams. SIGMOD 2017: 109-124
PDFSLIDESO. Poppe, Chuan Lei, E. A. Rundensteiner, et al.: CAESAR: Context-Aware Event Stream Analytics for Urban Transportation Services. EDBT 2017: 590-593
PDFZ. Zhuang, Chuan Lei, E. A. Rundensteiner, and M. Y. Eltabakh: PRO: Preference-Aware Recurring Query Optimization. CIKM 2016: 2191-2196
PDFPOSTERM. Ray, Chuan Lei, and E. A. Rundensteiner: Scalable Pattern Sharing over Event Streams. SIGMOD 2016: 495-510
PDFSLIDESM. Ray, Chuan Lei, and E. A. Rundensteiner: SPASS: Scalable Event Stream Processing Leveraging Sharing Opportunities. Invited Poster, DEBS 2016: 336-339
PDFPOSTERO. Poppe, Chuan Lei, E. A. Rundensteiner, and D. Dougherty: Context-aware Event Stream Analytics. EDBT 2016: 425-436
PDFSLIDESE. A. Rundensteiner, O. Poppe, Chuan Lei, et al.: Exploiting Sharing Opportunities for Real-time Complex Event Analytics. Invited Paper, IEEE Data Eng. Bull. 38(4): 82-93 (2015)
PDFChuan Lei, Z. Zhuang, E. A. Rundensteiner, and M. Y. Eltabakh: Shared Execution of Recurring Workloads in MapReduce. PVLDB 8(7): 714-725 (2015)
PDFChuan Lei, E. A. Rundensteiner, and M. Y. Eltabakh: Redoop: Supporting Recurring Queries in Hadoop. EDBT 2014: 25-36
PDFChuan Lei, Z. Zhuang, E. A. Rundensteiner, and M. Y. Eltabakh: Redoop Infrastructure for Recurring Big Data Queries. PVLDB 7(13): 1589-1592 (2014)
PDFChuan Lei, E. A. Rundensteiner: Robust Distributed Query Processing for Streaming Data. ACM Trans. Database Syst. 39(2):17 (2014)
PDFChuan Lei, E. A. Rundensteiner, and J. D. Guttman: Robust Distributed Stream Processing. ICDE 2013: 817-828
PDFR. Nehme, K. Works, Chuan Lei, E. A. Rundensteiner, and E. Bertino: Multi-route query processing and optimization. J. Comput. Syst. Sci. 79(3):312-329 (2013)
PDF
Patents
Synthetic Data Generation and Validation for Training and Evaluating Text-to-SQL Agents
Patent US App. 63/913,328, 2025Detecting Semantic Errors In Generated Query Language Instructions According To Heterogeneous Error Signals
Patent US App. 19/208,421, 2025AutoDoc: Concise and Consistent/Dependable Table Description Generation
Patent US App. 18/519,373, 2023Automatic Related Table Discovery in Data Lakes
Patent US App. 18/478,732, 2023Item Attribute Determination Using a Co-Engagement Graph
Patent US App. 17/935,916, 2022Directly Identifying Items From an Item Catalog Satisfying a Received Query Using a Model Determining Measures of Similarity Between Items in the Item Catalog and the Query
Patent US App. 17/542,491, 2021Hybrid Graph Neural Network
US Patent 12,333,425, 2025Guided Exploration for Conversational Business Intelligence
US Patent 12,393,615, 2025Ontology-based Data Storage for Distributed Knowledge Bases
US Patent 12,387,112, 2025Entity Disambiguation Using Graph Neural Networks
US Patent 12,159,224, 2024Query Relaxation Using External Domain Knowledge for Query Answering
US Patent 11,841,867, 2023Reducing Response Time for Queries Directed to Domain-Specific Knowledge Graph using Property Graph Schema Optimization
US Patent 11,157,467, 2021
