-------- Original-Nachricht -------- Betreff: Feature Selection for Data Mining 2006 - Call for Papers (FSDM 2006) Datum: Wed, 28 Dec 2005 11:34:23 -0700 (MST) Von: Huan Liu Huan.Liu@asu.edu Firma: "OptimaNumerics" An: Computational Science Mailing List computational.science@lists.optimanumerics.com
International Workshop on Feature Selection for Data Mining - Interfacing Machine Learning and Statistics
in conjunction with 2006 SIAM Data Mining Conference, April 22, 2006 Bethesda, Maryland
http://enpub.eas.asu.edu/workshop/2006
Knowledge discovery and data mining (KDD) is a multidisciplinary effort to extract nuggets of information from data. Massive data sets have become common in many applications and pose novel challenges for KDD. Along with changes in size, the context of these data runs from the loose structure of text and images and to designs of microarray experiments. Research in computer science, engineering, and statistics confront similar issues in feature selection, and we see a pressing need for and benefits in the interdisciplinary exchange and discussion of ideas. We anticipate that our collaborations will shed light on research directions and provide the stimulus for creative breakthroughs.
This workshop will bring together researchers from different disciplines and encourage collaborative research in feature selection. Feature selection is an essential step in successful data mining applications. Feature selection has practical significance in many areas such as statistics, pattern recognition, machine learning, and data mining. The objectives of feature selection include: building simpler and more comprehensible models, improving data mining performance, and helping to prepare, clean, and understand data.
Submissions that consider knowledge in feature selection will receive special consideration. Knowledge here means some declarative knowledge that can be explicitly expressed by a domain expert such as constraints. One form of using knowledge is semi-supervised learning. The semi-supervised situation remains prevalent, even in the presence of massive data sets. The high expense of “marking documents” leads to situations in which one has massive data describing the feature space, but relatively little describing the relationship between features and the response. We encourage presentations featuring both the theory behind feature selection as well as novel applications to data. Additional workshop topics include the following.
- Dimensionality reduction (feature ranking, subset selection, feature extraction) - Feature construction - Improving data mining performance - Novel data structures - Streaming data reduction and time series - Selection for labeled and unlabeled data - Modeling variable and feature selection - Goodness measures and evaluation (e.g., false discovery rates) - Ensemble methods - Selection bias - Sampling methods - Selection with small samples - Cross-discipline comparative studies (microarray, text, Web) - Integration with data mining algorithms - Real-world case studies and applications - Emerging challenges (e.g., survival analysis, connecting selection and causality, knowledge in feature selection)
Paper Format, Important Dates, and Submission o A paper (maximum 8 pages in single column, no smaller than 11 pt) should be submitted in PDF or WORD format o Submissions should be emailed to featureselection@gmail.com o Quality short papers, position papers are also welcome o The deadline for submission: January 9, Monday. o Acceptance notification: February 1, Wednesday o Camera ready due: February, 14, Tuesday o The accepted papers will be published in the workshop proceedings. o Accepted papers will be considered for a special issue in a prestigious journal.
More information can be found at the workshop website http://enpub.eas.asu.edu/workshop/2006.
--------------------------------------------------------------------- To unsubscribe, e-mail: computational.science-unsubscribe@lists.optimanumerics.com For additional commands, e-mail: computational.science-help@lists.optimanumerics.com
Computational Science mailing list hosting is provided by OptimaNumerics (http://www.OptimaNumerics.com) ---------------------------------------------------------------------