DataGEMS in EDBT/ICDT 2025 Joint Conference Anna Mitsopoulou from ATHENA RC presented “Analysis of Text-to-SQL Benchmarks: Limitations, Challenges and Opportunities” at EDBT/ICDT 2025 Conference. A comprehensive methodology is introduced for analysing text-to-SQL datasets and identifying their limitations. The proposed evaluation approach, which combines dataset analysis with automatic error analysis, provides deeper insights into models’ strengths and weaknesses. The EDBT/ICDT 2025 was held in Barcelona from 25th to 28th March, 2025 More info: https://edbticdt2025.upc.edu/?contents=main.html
DataGEMS in EDBT/ICDT 2025 PhD Workshop
DataGEMS in EDBT/ICDT 2025 PhD Workshop Christos Tsapelas from ATHENA RC participated at the EDBT/ICDT 2025 PhD Workshop and presented his PhD proposal “Towards a Neural Database Execution Engine”. He is developing a neural query execution engine that combines the speed of databases with the semantic reasoning of language models-enabling efficient, scalable queries across structured and unstructured data. This hybrid approach aims to redefine how we query multi-modal data systems, paving the way to advanced semantic analytics. The EDBT/ICDT 2025 was held in Barcelona from 25th to 28th March, 2025 More info: https://edbticdt2025.upc.edu/?contents=PhDWorkshop.html
DataGEMS at OL2A 2025!
DataGEMS at OL2A 2025! On April 29, during the International Conference on Optimization, Learning Algorithms and Applications (OL2A), the DataGEMS project was presented to the academic community by our Portuguese partners from the Polytechnic Institute of Bragança (IPB). The presentation showcased DataGEMS’ innovative ecosystem for dataset discovery and management, powered by natural language, AI and graph-based models. Want to stay updated? Scan the QR code to subscribe to the latest news and updates from the DataGEMS project! More info: https://ol2a.ipb.pt/ui/#/home
Analysis of Text-to-SQL Benchmarks: Limitations, Challenges
Towards a Neural Database Execution Engine Christos Tsapelas1,2 Supervised by Georgia Koutrika21Department of Informatics and Telecommunications, National and Kapodistrian University of Athens2Archimedes, Athena Research Center, Greece See more: https://datagems.eu/wp-content/uploads/2025/05/PhD-Workshop-3.pdf
Towards a Neural Database Execution Engine
Analysis of Text-to-SQL Benchmarks: Limitations, Challenges and Opportunities Anna MitsopoulouAthena Research Center, Greeceanna.mitsopoulou@athenarc.gr Georgia KoutrikaAthena Research Center, Greecegeorgia@athenarc.gr ABSTRACT Despite being a fast-paced research field, text-to-SQL systems face critical challenges. The datasets used for the training and evaluation of these systems play a vital role in determining their performance as well as the progress in the field. In this work, we introduce a methodology for text-to-SQL dataset analysis, and we perform an in-depth analysis of several text-to-SQL datasets, providing valuable insights into their capabilities and limitations and how they affect training and evaluation of text-to-SQL systems. We investigate existing evaluation methods, and propose an informative system evaluation based on error analysis. We show how our dataset analysis can help explain the behavior of a system on different datasets. Using our error analysis, we further show how we can pinpoint the sources of errors of a text- to-SQL system for a particular dataset and reveal opportunities for system improvements. 1 INTRODUCTION Text-to-SQL systems translate natural language (NL) questions to SQL relieving users from the use of SQL for accessing data in relational databases. In recent years, text-to-SQL systems [37 , 44, 51 , 58 ] have achieved significant advancements due to the use of large language models (BERT [ 9 ], T5 [ 49 ], GPT [ 48]) and the creation of task-specific datasets (e.g., WikiSQL [ 75 ], Spider [ 68 ]) used for system training and evaluation. These approaches tackle the text-to-SQL problem as a language translation prob-lem, and they train a neural network on a large amount of {NL question/SQL query} pairs [27].Unfortunately, unlike systems that translate from one natural language to another, or from natural language to code (e.g.,Python), text-to-SQL systems face challenges and do not enjoy asbroad adoption, despite the high competition that exists among them. The datasets used for the training and evaluation of text-to SQL systems play a vital role in the performance of these systems as well as in determining the progress in the field. While a system trained on a benchmark like Spider [68] may exhibit good performance on this benchmark, when it is used on a different benchmark or used in a real application/domain,it does not fare as well. Several factors, such as the type of SQL queries, their distribution, the domains, and even the size of the data, matter when training a system. A system cannot perform well for unseen (or even not “seen enough”) queries or data. On the other hand, when evaluating a text-to-SQL system, a text to-SQL benchmark may create false expectations on the query translation capabilities of the system. For example, a system achieving 80% accuracy on a dataset with simple queries could be worse than one achieving 60% accuracy on a dataset with more complex queries. An absolute accuracy number does not© 2025 Copyright held by the owner/author(s). Published in Proceedings of the 28th International Conference on Extending Database Technology (EDBT), 25th March-28th March, 2025, ISBN 978-3-98318-097-4 on OpenProceedings.org. Distribution of this paper is permitted under the terms of the Creative Commons license CC-by-nc-nd 4.0. provide much insight unless we consider the characteristics of the evaluation dataset, such as the types and distributions of NL and SQL queries. It is also important to be able to pinpoint the sources of the errors a text-to-SQL system makes, and hencereveal opportunities for system improvements. While WikiSQL [75 ] and Spider [ 68 ] are the first large-scale, multi-domain benchmarks for training and evaluating neural text-to-SQL systems, several datasets have preceded and followed them (e.g., [ 6, 15 , 32 , 38 , 70]) that serve different purposes and focus on different aspects of the text-to-SQL problem. In contrast to the effort to understand text-to-SQL systems through studies and surveys [1, 2, 7, 18, 24 , 28 , 39 , 47], extensive studies and evaluations of text-to-SQL datasets are missing. However, as we explained above, understanding the capabilities and limitations of text-to-SQL datasets is vital for making progress in the field. In this work, we present a structured survey of text-to-SQL datasets, their design objectives as well as their shortcomings. Moreover, we present a text-to-SQL dataset analysis methodology that provides a set of dimensions and measures to analyze and characterize the richness and distributions of the SQL queries, the natural language questions and the databases covered by a dataset. Using our methodology, we evaluate several datasets, and provide valuable insights into their value, complexity, and limittions. This analysis also provides insights into the limitations of current text-to-SQL systems, and reveals several opportunities for research for the development of more effective benchmarks. Furthermore, we investigate the methods and metrics for evaluating text-to-SQL systems and we point out their shortcomings. We propose an alternative in the direction of a more informative evaluation that combines a new metric and error analysis based on an automatically generated SQL query categorization that canprovide insights about the system capabilities and pain points. We show the potential of our approach for providing a more fine-grained system evaluation. In particular, we experimentally show how our dataset analysis can help explain the behavior of a system on different datasets. Using our proposed error analysis, we further show how we can pinpoint the sources of errors of a text-to-SQL system for a particular dataset.In a nutshell, our contributions are summarized as follows: We present a structured study of text-to-SQL datasets. We present a methodology for evaluating text-to-SQL datasets. We present an in-depth analysis of several text-to-SQL datasets. We provide an error analysis method combining a new metricand an automatically generated SQL query categorization. We show the potential of our dataset analysis methodologyand error analysis for more fine-grained and insightful systemevaluations. 2 TEXT-TO-SQL SYSTEMS Research on text-to-SQL systems dates back to seventies, however, recently, the use of deep learning techniques has given great boost in the development of such systems [ 31 ]. The most successful text-to-SQL systems [ 36 , 37 , 44 , 51 ] rely on pretrained language models (e.g., T5 or GPT-based architectures) and
Celebrating a Decade of the International Day of Women and Girls in Science
Celebrating a Decade of the International Day of Women and Girls in Science February 11 marks the International Day of Women and Girls in Science, a crucial occasion dedicated to recognizing and promoting gender equality in Science, Technology, Engineering, and Mathematics (STEM). Established by the United Nations in 2015, this day serves as a global call to action to break down barriers, challenge stereotypes, and foster an environment where women and girls can thrive in scientific fields. As we celebrate the 10th anniversary of this important day in 2025, it is essential to acknowledge the progress made while also recognizing the ongoing challenges that women and girls face in STEM disciplines. Systemic barriers, gender biases, and limited access to resources still hinder many from pursuing and advancing in scientific careers. Despite these challenges, women around the world continue to make groundbreaking contributions that shape the future of innovation and discovery. At DataGEMS, we take immense pride in the talented women who play an integral role in our success. From scientists and researchers to managers and leaders, their expertise, dedication, and innovative spirit drive our mission forward. Their contributions not only enrich our projects but also serve as an inspiration for the next generation of women in science. Creating a more inclusive and diverse scientific community requires collective effort. It involves promoting equitable opportunities, providing mentorship and support, and ensuring that women and girls receive the recognition they deserve for their achievements in STEM. As we commemorate this milestone, let us reaffirm our commitment to fostering a future where every woman and girl in science has the tools, support, and encouragement to reach their full potential. Together, we can build a world where talent and passion—not gender—define success in scientific and technological fields.
EOSC: DataGEMS Kickoff Featured in February Newsletter
EOSC: DataGEMS Kickoff Featured in February Newsletter The European Open Science Cloud (EOSC) community is buzzing with activity, and the DataGEMS project has proudly secured a spotlight in the EOSC Association’s February newsletter! This exciting development highlights the project’s significance within the broader EOSC landscape. What’s Inside the February Newsletter? The newsletter provides a comprehensive overview of the latest developments within the EOSC ecosystem, including: EOSC United Awarded: Significant progress in the EOSC initiative is highlighted, with the EOSC United project receiving notable recognition. EOSC Winter School 2025: An announcement regarding the upcoming EOSC Winter School in 2025, offering valuable educational opportunities for those involved in open science. EOSC Macro-Roadmap: An update on the strategic direction of EOSC, outlining the macro-roadmap for its future development. Update on the EOSC Federation Handbook: Progress on the essential EOSC Federation Handbook, providing guidelines and frameworks for collaboration. Update from Horizon Europe Project: Information regarding ongoing projects within the Horizon Europe program that are contributing to the EOSC initiative. Multiple Interviews: Insights and perspectives from key figures within the EOSC community, offering valuable knowledge and expertise. DataGEMS’ Role in EOSC The inclusion of the DataGEMS kickoff in this newsletter underscores the project’s relevance to the EOSC’s overarching goals. DataGEMS is contributing to developing and enhancing the EOSC ecosystem, and its participation is a valuable addition to the community. Staying Informed To stay up-to-date on the latest news and developments within the EOSC community, we encourage you to: Read the full EOSC Association’s February newsletter: https://mailchi.mp/eosc/eosc-association-newsletter-february-2025
DataGEMS: Successful Kick-off Meeting in Athens
DataGEMS: Successful Kick-off Meeting in Athens The European Open Science Cloud (EOSC) has welcomed the project, DataGEMS: Data Discovery Platform with Generalized Exploratory, Management, and Search Capabilities, which officially commenced on January 1st, 2025. This ambitious initiative, funded under the Horizon Europe program, aims to transform how researchers and professionals discover and manage diverse datasets. The project’s launch was marked by a highly successful kick-off meeting held in Athens on January 9th and 10th, 2025. The event drew over 50 participants, including representatives from all partner organizations and the Advisory Board committee, who contributed actively both in person and remotely. The meeting was inaugurated with opening greetings from key figures in the EOSC community: Enrico Pellizari, Project Manager from the EU side, provided valuable insights into the project’s strategic importance within the broader EOSC framework. Ute Gunsenheimer, General Secretary of the EOSC Association, highlighted the project’s alignment with the EOSC’s overarching vision. Prof. Ioannis Emiris, President of the Athena Research Center, welcomed the consortium and emphasized the project’s potential impact on the scientific community. DataGEMS: A Next-Generation Data Ecosystem The EU-funded DataGEMS project addresses the critical challenge of discovering, combining, and exploring heterogeneous datasets. By developing a next-generation dataset discovery and management ecosystem, DataGEMS will provide innovative algorithms to unlock the potential of diverse data sources. Key Features and Technologies DataGEMS will leverage state-of-the-art technologies in: Data Management: To efficiently organize and access large volumes of data. Natural Language Processing (NLP): To understand and analyze textual data. Machine Learning (ML): To identify patterns and relationships within datasets. Exploring Diverse Data Modalities The project will focus on datasets with varying data modalities, including: Tabular data Text documents Knowledge graphs Images Promoting FAIR Data Principles DataGEMS is committed to promoting FAIR (Findable, Accessible, Interoperable, Reusable) data principles in crucial domains such as education, meteorology, and linguistics. Transforming Data Exploration The project’s primary objective is to empower users with intuitive tools that enable them to gain new insights from complex, heterogeneous datasets. By facilitating seamless data discovery and analysis, DataGEMS will unlock the potential of data-driven research and innovation. Looking Ahead The successful kick-off meeting in Athens has laid a strong foundation for the DataGEMS project. With its ambitious goals and cutting-edge technologies, DataGEMS is poised to make a significant contribution to the EOSC ecosystem and the broader scientific community. Stay tuned for further updates as this exciting project progresses.