-------- Forwarded Message -------- Subject: [AISWorld] Call for Papers: JDIQ - Special Issue on Deep Learning for Data Quality Date: Tue, 27 Oct 2020 08:55:42 -0700 From: Donatello Santoro donatello.santoro@unibas.it To: aisworld@lists.aisnet.org
___________________________________________________________
CALL FOR PAPERS
ACM Journal of Data and Information Quality Special Issue on Deep Learning for Data Quality
___________________________________________________________
* Guest Editors: -Paolo Papotti, EURECOM (France) -Donatello Santoro, Università degli Studi della Basilicata (Italy) -Saravanan Thirumuruganathan, QCRI (Qatar)
* Context: Deep learning (DL) has been recently used successfully for monitoring and improving data quality (DQ). Examples include data integration tasks such as entity resolution and schema matching, data cleaning tasks such as error detection and repair, and data curation in general. The data curation community has successfully leveraged deep learning techniques spanning from word embeddings to transformers to achieve state-of-the-art performance on well established data quality benchmarks. Nevertheless, there is still an open debate on which technical solution performs best for relational data and under which setting.
Despite a promising start, deep learning for data quality has a long way to go in achieving the human level performance that it has achieved in domains such as computer vision, natural language processing, and speech recognition. While there have been some substantial improvements in specific tasks such as entity resolution and data repair/imputation, many of the other data quality tasks (such as data discovery, data profiling, data integration, record fusion) are yet to fully benefit from the DL revolution. Also, it is not clear how to push DL techniques to get the same level of adaptation achieved by more traditional logic-based methods. For example, interpretability of the models is a key stumbling block. How can one develop DQ explanations that are consumed by non-experts? Should the explanation be generated individually for each error? Or can it be summarized so that the user gets a high level overview? Finally, DL data quality tools need novel explanation algorithms which are not a priority for DL researchers as the architecture is quite specific.
This special issue focuses on deep learning used for assessing and improving the quality of data. Thus, the issue is addressed to those members from the data science community proposing novel methods, architectures and algorithms capable of integrating, cleaning and profiling relational data sources with supervised and unsupervised approaches.
* Topics: The goal of this special issue is to collect recent advances, innovations, and practices in ML, data and software engineering for building techniques, solutions, and systems that support users in assessing and improving relational data quality. The topics of interest are inspired from the themes above and include, but are not limited to: - Deep learning methods for data integration and data cleaning - Deep learning methods for metadata discovering/profiling, including constraint discovery - Making deep learning methods for data quality interpretable - Experimental studies of deep learning methods for data quality - Deep learning methods for curating data in domain specific applications - Scalability of deep learning methods for data quality (speeding up DL for DQ using GPU) - Characterization of data quality tasks that are more amenable to deep learning - Reducing the need of large amount of training data in supervised approaches (weak- and self-supervision for data quality) - Combination of logic based and DL based methods for data quality
* Expected contributions: We welcome three types of research contributions: - Full research papers describing a novel contribution to the field (up to 25 pages) - Experience papers discussing important lessons learned (up to 20 pages) - Vision and Challenge papers (up to 7 pages) - Survey papers (up to 30 pages)
* Submission Format: JDIQ welcomes manuscripts that extend prior published work, provided they contain at least 30% new material, and that the significant new contributions are clearly identified in the introduction.
Submission guidelines with Latex (preferred) or Word templates are available at: http://jdiq.acm.org/authors.cfm#subm
To submit, select the paper type "SI: Deep Learning for Data Quality"
* Important Dates: - Submission deadline: March 1, 2021 - First notification: May 15, 2021 - Revised manuscripts deadline: July 15, 2021 - Final notification: September 15, 2021 - Camera-ready manuscripts: October 15, 2021 - Estimated publication date: January 2022 _______________________________________________ AISWorld mailing list AISWorld@lists.aisnet.org