Go to content Go to navigation Go to search

ICDM 2008

IEEE International Conference on Data Mining

Pisa, Italy
15-19 December 2008

Accepted Papers

ICDM08 Regular Papers:

Paper Number TitleAuthors
DM202 “Unsupervised Cross-domain Learning by Interaction Information Co-clustering”
Shin Ando and Einoshin Suzuki
DM216 “Transductive Component Analysis”
Wei Liu
DM218 “Dirichlet Process Based Evolutionary Clustering”
Tianbing Xu, Zhongfei Zhang, Philip Yu, and Bo Long
DM232 “Border Sampling Through Coupling Markov Chain Monte Carlo”
Guichong Li, Nathalie Japkowicz, Trevor J. Stocki, and R. Kurt Ungar
DM237 “Non-negative Matrix Factorization on Manifold”
Deng Cai
DM258 “Clustering Uncertain Data using Voronoi Diagrams”
Ben Kao, Sau Dan Lee, David Cheung, Wai-Shing Ho, and K. F. chan
DM273 “Formal Models for Expert Finding on DBLP Bibliography Data”
Hongbo Deng, Michael R. Lyu, and Irwin King
DM286 “Metropolis Algorithms for Representative Subgraph Sampling”
Christian Huebler, Karsten Borgwardt, Hans-Peter Kriegel, and Zoubin Ghahramani
DM299 “A Non-parametric Semi-supervised Discretization Method”
Alexis Bondu, Marc Boullé, Vincent Lemaire, Stéphane Loiseau, and Béatrice Duval
DM312 “Collaborative Filtering for Implicit Feedback Datasets”
Yehuda Koren, Yifan Hu, and Chris Volinsky
DM314 “Exploiting Local and Global Invariants for the Management of Large Scale Information Systems”
Haifeng Chen, Haibin Cheng, Guofei Jiang, and Kenji Yoshihira
DM319 “Scalable Tensor Decompositions for Multi-aspect Data Mining”
Tamara Kolda and Jimeng Sun
DM323 “One-Class Collaborative Filtering”
Rong Pan, Yunhong Zhou, Bin Cao, Nathan N. Liu, Rajan Lukose, Martin Scholz, and Qiang Yang
DM342 “Visualization of Temporal Changes in Cluster Structures using Self-Organizing Maps”
Denny Denny, Graham J. Williams, and Peter Christen
DM357 “Finding Good Itemsets by Packing Data”
Nikolaj Tatti and Jilles Vreeken
DM374 “Fast Counting of Triangles in Large Real Networks, without counting: Algorithms and Laws”
Charalampos Tsourakakis
DM379 “A Generative Probabilistic Model for Multi-Label Classification”
Hongning Wang, Minlie Huang, and Xiaoyan Zhu
DM427 DECK: Detecting Events from Web Click-through Data”
Ling Chen, YiQun Hu, and Wolfgang Nejdl
DM441 “Learning on Weighted Hypergraphs to Integrate Protein Interactions and Gene Expressions for Cancer Outcome Prediction”
TaeHyun Hwang, Ze Tian, Jean-Pierre Kocher, and Rui Kuang
DM442 “A Fast Method to Mine Frequent Subsequences from Graph Sequence Data”
Akihiro Inokuchi and Takashi Washio
DM449 “Learning Bayesian Networks: A MAP Criterion for Joint Selection of Model Structure and Parameter”
Carsten Riggelsen
DM463 “Evolutionary Clustering by Hierarchical Dirichlet Process with Hidden Markov State”
Tianbing Xu, Zhongfei Zhang, Philip Yu, and Bo Long
DM465 “Towards Online Analytical Processing on Graphs”
Chen Chen, Xifeng Yan, Feida Zhu, Jiawei Han, and Philip S. Yu
DM469 “Measuring Proximity on Graphs with Side Information”
Hanghang Tong, Huiming Qu, and Hani Jamjoom
DM479 “Interpreting PET Scans by Structured Patient Data: A Data Mining Case Study in Dementia Research”
Andreas Hapfelmeier, Jana Schmidt, Marianne Mueller, Robert Perneczky, Alexander Drzezga, Alexander Kurz, and Stefan Kramer
DM490 “Bayesian Co-clustering”
Hanhuai Shan and Arindam Banerjee
DM509 “Inlier-based Outlier Detection via Direct Density Ratio Estimation”
Shohei Hido, Yuta Tsuboi, Hisashi Kashima, Masashi Sugiyama, and Takafumi Kanamori
DM522 “SpecVAT: Enhanced Visual Cluster Analysis”
Liang Wang, Xin Geng, James Bezdek, Christopher Leckie, and Rao Kotagiri
DM526 TOFA: Trace Oriented Feature Analysis in Text Categorization”
Jun Yan and Qiang Yang
DM572 “Mining Order-Preserving Submatrices from Data with Repeated Measurements”
Chun Kit Chui, Ben Kao, Kevin Y. Yip, and Sau Dan Lee
DM578 “Unsupervised Face Annotation by Mining the Web”
Duy-Dinh Le and Shin’ichi Satoh
DM579 “Isolation Forest”
Fei Tony Liu, Kai Ming Ting, and Zhi-Hua Zhou
DM590 “Learning by Propagability”
Bingbing Ni, Shuicheng Yan, Loong Fah Cheong, and Ashraf Kassim
DM593 “Predicting Future Decision Trees from Evolving Data”
Mirko Boettcher, Martin Spott, and Rudolf Kruse
DM597 “Enhancing the Stability of Spectral Ordering with Sparsification and Partial Supervision: Application to Paleontological Data”
Dimitrios Mavroeidis and Ella Bingham
DM609 “Temporal-Relational Classifiers for Prediction in Evolving Domains”
Umang Sharan and Jennifer Neville
DM611 “Semi-supervised Learning from General Unlabeled Data”
Kaizhu Huang, Zenglin Xu, Irwin King, and Michael R. Lyu
DM614 “A Novel Language-Model-Based Approach for Image Object Mining and Re-Ranking”
JenHao Hsiao, Chu-Song Chen, and Ming-Syan Chen
DM619 “Maximum Margin Clustering with Pairwise Constraints”
Yang Hu, Jingdong Wang, Nenghai Yu, and Xian-Sheng Hua
DM635 “Anti-Monotonic Overlap-Graph Support Measures”
Toon Calders, Jan Ramon, and Dries Van Dyck
DM651 “Space-Efficient String Mining under Frequency Constraints”
Johannes Fischer, Veli Mäkinen, and Niko Välimäki
DM658 “A Robust Discriminative Term Weighting based Linear Discriminant Method for Text Classification”
Khurum Nazir Junejo and Asim Karim
DM676 “A Randomized Approach for Approximating the Number of Frequent Sets”
Mario Boley and Henrik Grosskreutz
DM701 SCS: A New Similarity Measure for Categorical Sequences”
Abdellali kelil and Shengrui Wang
DM713 LBF: A Labeled-Based Forecasting Algorithm and its Application to Electricity Price Time Series”
Francisco Martinez, Alicia Troncoso, Jose C. Riquelme, and Jesus S. Aguilar
DM772 “Paired Learners for Concept Drift”
Steve Bach and Mark Maloof
DM776 “Balancing Spectral Clustering for Segmenting Spatio-Temporal Observations of Multi-Agent Systems”
Balint Takacs and Yiannis Demiris
DM785 SPARCL: Efficient and Effective Shape-based Clustering”
Vineet Chaoji, Mohammad Hasan, Saeed Salem, and Mohammed J. Zaki
DM790 “Start Globally, Optimize Locally, Predict Globally: Improving Performance on Unbalanced Data”
David Cieslak and Nitesh Chawla
DM791 “DisCo: Distributed Co-clustering with Map-Reduce (A Case Study Towards Petabyte-Scale End-to-End Mining)”
Spiros Papadimitriou and Jimeng Sun
DM793 “Nonnegative Matrix Factorization for Data Mining Optimization: Clustering and Beyond”
Chris Ding and Tao Li
DM807 “xCrawl: A High-Recall Crawling Method for Web Mining”
Kostyantyn Shchekotykhin, Dietmar Jannach, and Gerhard Friedrich
DM848 “Generalized Framework for Syntax-based Relation Mining”
Bonaventura Coppola, Alessandro Moschitti, and Daniele Pighin
DM851 “M3MIML: A Maximum Margin Method for Multi-Instance Multi-Label Learning”
Min-Ling Zhang and Zhi-Hua Zhou
DM857 “On-Line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking”
Loulwah AlSumait, Carlotta Domeniconi, and Daniel Barbara
DM861 “Web mining for Understanding Stories through Graph Visualization”
Ilija Subasic and Bettina Berendt
DM866 “Mining Periodic Behavior in Dynamic Social Networks”
Mayank Lahiri and Tanya Berger-Wolf
DM880 “Predicting the Helpfulness of Online Reviews”
Yang Liu, Xiangji Huang, Aijun An, and Xiaohui Yu
DM888 “Computationally Efficient Estimators for Dimension Reductions Using Stable Random Projections”
Ping Li
DM891 “What Sperner Family Concept Class Is Easy to Be Enumerated?”
Atsuyoshi Nakamura and Mineichi Kudo
DM899 “Comparison of Cluster Representations from Partial Second- to Full Fourth-Order Cross Moments for Data Stream Clustering”
Mingzhou (Joe) Song and Lin Zhang
DM900 “Clustering Distributed Time Series in Sensor Networks”
Jie Yin and Mohamed Medhat Gaber
DM910 “Overlapping Matrix Pattern Visualization: a Hypergraph Approach”
Ruoming Jin, Yang Xiang, David Fuhry, and Feodor Dragan
DM911 “SeqStream: Mining Closed Sequential Patterns over Stream Sliding Windows”
Lei Chang, Tengjiao Wang, Dongqing Yang, and Hua Luan
DM913 TEFE: A Time-Efficient Approach to Feature Extraction”
Li-Ping Liu, Yang Yu, Yuan Jiang, and Zhi-Hua Zhou
DM914 “Improving Collaborative Filtering Recommendations Using External Data”
Akhmed Umyarov and Alexander Tuzhilin
DM917 “Scaling Up Classifiers to Cloud Computers”
Christopher Moretti, Karsten Steinhaeuser, Douglas Thain, and Nitesh Chawla
DM930 “Supervised Inductive Learning with Lotka-Volterra derived models”
Karen Hovsepian, Peter Anselmo, and Subhasish Mazumdar
DM931 “Toward Faster Nonnegative Matrix Factorization: A New Algorithm and Comparisons”
Jingu Kim and Haesun Park
DM936 “Efficient discovery of statistically significant association rules”
Wilh Hämäläinen and Matti Nykänen

ICDM08 Short Papers:

Paper Number TitleAuthors
DM207 “Computational Discovery of Motifs Using Hierarchical Clustering Techniques”
Dianhui Wang and Nung Kion Lee
DM229 “Estimating Aggregates over Multiple Sets”
Edith Cohen and Haim Kaplan
DM246 “Latent Dirichlet Allocation and Singular Value Decomposition Based Multi-Document Summarization”
Rachit Arora and Ravindran Balaraman
DM260 “Maximum Margin Embedding”
Bin Zhao, Fei Wang, and Changshui Zhang
DM275 “Using Wikipedia for Co-clustering Based Cross-domain Text Classification”
Pu Wang, Carlotta Domeniconi, and Jian Hu
DM278 “Efficient Feature Selection in the Presence of Multiple Feature Classes”
Paramveer S. Dhillon, Dean Foster, and Lyle Ungar
DM284 “Frequent Subgraph Retrieval in Geometric Graph Databases”
Sebastian Nowozin and Koji Tsuda
DM285 “A Non-parametric Approach to Pair-wise Dynamic Topic Correlation Detection”
Yang Song, Lu Zhang, and C. Lee Giles
DM287 “Support Vector Regression for Censored Data (SVRc): A Novel Tool for Survival Analysis”
Faisal Khan and Valentina Bayer-Zubek
DM295 “Effective Visualization of Frequent Itemsets”
Carson Kai-Sang Leung
DM310 “Graph-based Rare Category Detection”
Jingrui He, Yan Liu, and Richard Lawrence
DM331 “Anomaly Detection Support Vector Machine and Its Application to Fault Diagnosis”
Ryohei Fujimaki
DM346 “Nearest Neighbour Classifiers for Streaming Data with Delayed Labelling”
Ludmila I. Kuncheva and J. Salvador Sanchez
DM361 “Prediction of Skin Penetration Using Machine Learning Methods”
Yi Sun, Gary Moss, Maria Prapopoulou, Rod Adams, Marc Brown, and Neil Davey
DM364 “Block-Iterative Algorithms for Non-Negative Matrix Factorization”
Suvrit Sra
DM367 RTM: Laws and a Recursive Generator for Weighted Time-Evolving Graphs”
Leman Akoglu, Mary McGlohon, and Christos Faloutsos
DM373 “Organic pie charts”
Fabian Moerchen
DM396 “A Practical Approach to Classify Evolving Data Streams: Training with Limited Amount of Labeled Data”
Mohammad Masud, Jing Gao, Latifur Khan, Jiawei Han, and Bhavani Thuraisingham
DM398 “Why Stacked Models Perform Effective Collective Classification”
Andrew Fast and David Jensen
DM404 “TreeQA: Tree-Based Quantitative Association Analysis”
Feng Pan, Lynda Yang, Leonard McMillan, Fernando Pardo-Manuel de Villena, David Threadgill, and Wei Wang
DM436 “A Supervised Combined Feature Extraction Method for Recognition”
Tingkai Sun, Songcan Chen, Jingyu Yang, and Pengfei Shi
DM444 “A Probability Model for Projective Clustering on High Dimensional Data”
Lifei Chen, Qingshan Jiang, and Shengrui Wang
DM448 “A Topic Modeling Approach and its Integration into the Random Walk Framework for Academic Search”
Jie Tang, Ruoming Jin, and Jing Zhang
DM461 “Inference Analysis in Privacy-Preserving Data Re-publishing”
Guan Wang, Zutao Zhu, Wenliang Du, and Zhouxuan Teng
DM473 “Text Cube: Computing IR Measures for Multidimensional Text Database Analysis”
Xide Lin, Bolin Ding, Jiawei Han, Feida Zhu, and Bo Zhao
DM477 “Multi-label Classification using Ensembles of Pruned Sets”
Jesse Read, Bernhard Pfahringer, and Geoffrey Holmes
DM484 “Learning Latent Semantic Space for Effective Ranking”
Jun Yan and Shuicheng Yan
DM492 “Graph-based Iterative Hybrid Feature Selection”
Erheng Zhong, Sihong Xie, Wei Fan, Jiangtao Ren, Jing Peng, and Kun Zhang
DM504 “A Recommendation System for Preconditioned Iterative Solvers”
Thomas George, Anshul Gupta, and Vivek Sarin
DM507 “Finding Alternative Clusterings Using Constraints”
Ian Davidson and Zijie Qi
DM512 “Multiplicative Mixture Models for Overlapping Clustering”
Qiang Fu and Arindam Banerjee
DM521 “Variance Minimization Least Squares Support Vector Machines for Time Series Analysis”
Róbert Ormándi
DM528 “Publishing Sensitive Transactions for Itemset Utility”
Yabo Xu, Benjamin Fang, Ke Wang, Ada Fu, and Jian Pei
DM554 “Similarity Learning for Nearest Neighbor Classification”
Ali Mustafa QAMAR, Eric Gaussier, Jean-Pierre Chevallet, and Joo Hwee LIM
DM595 “Active Learning of Equivalence Relations by Minimizing the Expected Loss Using Constraint Inference”
Steffen Rendle and Lars Schmidt-Thieme
DM610 “Multi-Space-Mapped SVMs for Multi-Class Classification”
Bo Liu, Longbing Cao, Philip S. Yu, and Chengqi Zhang
DM613 “Direct Zero-norm Optimization for Feature Selection”
Kaizhu Huang, Irwin King, and Michael R. Lyu
DM624 “Filling in the Blanks – Krimp Minimisation for Missing Data”
Jilles Vreeken and Arno Siebes
DM631 “Text Mining in Radiology Reports”
Tianxia Gong, Chew Lim Tan, Tze Yun Leong, Cheng Kiang Lee, Boon Chuan Pang, C. C. Tchoyoson Lim, Qi Tian, Suisheng Tang, and Zhuo Zhang
DM645 RBNBC: Repeat Based Naive Bayes Classifier for Biological Sequences”
Pratibha Rani and Vikram Pudi
DM650 “Boosting Relational Sequence Alignments”
Andreas Karwath, Kristian Kersting, and Niels Landwehr
DM660 “Sequence Mining Automata: a New Technique for Mining Frequent Sequences Under Regular Expressions”
Roberto Trasarti, Francesco Bonchi, and Bart Goethals
DM663 “Alert Detection in System Logs”
Adam Oliner, Alex Aiken, and Jon Stearley
DM681 “A Joint Matrix Factorization Approach to Unsupervised Action Categorization”
Peng Cui, Fei Wang, Li-Feng Sun, and Shi-Qiang Yang
DM683 “Time Sensitive Ranking with Application to Publication Search”
Xin Li, Bing Liu, and Philip S. Yu
DM689 “A Conservative Feature Subset Selection Algorithm with Missing Data”
Alex Aussem and Sergio Rodrigues de Morais
DM693 INSCY: indexing subspace clusters with in-process-removal of redundancy”
Ira Assent, Ralph Krieger, Emmanuel Müller, and Thomas Seidl
DM719 “Nonparametric Monotone Classification with MOCA
Nicola Barile and Ad Feelders
DM732 “Discovering Flow Anomalies: A SWEET Approach”
James Kang, Shashi Shekhar, Chris Wennen, and Paige Novak
DM733 “Experimental Evaluation of the Value of Structure: How to Efficiently Exploit Interdependencies in Sequence Labeling”
Guillaume Wisniewski and Patrick Gallinari
DM741 “Robust Time-Referenced Segmentation of Moving Object Trajectories”
Hyunjin Yoon and Cyrus Shahabi
DM745 “Spatiotemporal Relational Probability Trees”
Amy McGovern, Nathan Hiers, Matthew Collier, David Gagne, and Rodger Brown
DM749 “Mining large networks with subgraph counting”
Ilaria Bordino, Debora Donato, Aristides Gionis, and Stefano Leonardi
DM758 “Collective Latent Dirichlet Allocation”
Zhi-Yong Shen, Jun Sun, and Yi-Dong Shen
DM773 “A Hierarchical Algorithm for Clustering Uncertain Data via an Information-Theoretic Approach”
Francesco Gullo, Gianni Ponti, Andrea Tagarelli, and Sergio Greco
DM782 “Cleansing Noisy Data Streams”
Xingquan Zhu
DM786 “Pseudolikelihood EM for Within-Network Relational Learning”
Rongjing Xiang and Jennifer Neville
DM810 “Fast and Memory Efficient Mining of High Utility Itemsets in Data Streams”
Hua-Fu Li, Hsin-Yun Huang, and Suh-Yin Lee
DM832 “Clustering Document with Active Learning using Wikipedia”
Anna Huang, David Milne, Eibe Frank, and Ian H. Witten
DM846 “Cost-Sensitive Parsimonious Linear Regression”
Robby Goetschalckx, Scott Sanner, and Kurt Driessens
DM847 “A Shrinkage Approach for Modeling Non-Stationary Relational Autocorrelation”
Pelin Angin and Jennifer Neville
DM864 “On Locally Linear Classification by Pairwise Coupling”
Feng Chen, Chang Tien Lu, and P. Boedihardjo Arnold
DM865 “Stream Sequential Pattern Mining with Precise Error Bounds”
Luiz Mendes, Bolin Ding, and Jiawei Han
DM868 “Discovering significant patterns in multi-attribute sequences”
Robert Gwadera and Fabio Crestani
DM869 “Spotting Significant Changing Subgraphs in Evolving Graphs”
Zheng Liu, Jeffrey Yu, Yiping Ke, Xuemin Lin, and Lei Chen
DM875 “Iterative Set Expansion of Named Entities using the Web”
Richard C. Wang and William Cohen
DM886 “Sparse Maximum Margin Logistic Regression for Credit Scoring”
Sabyasachi Patra, Debasis Kundu, and Kripa Shanker
DM889 HIREL: An Incremental Clustering Algorithm for Multi-Type Relational Datasets”
Tao Li and Sarabjot Anand
DM890 “Clustering Geospatial Objects via Hidden Markov Random Fields”
Makoto Sato and Shuuichiro Imahara
DM894 “Comparative Evaluation of Anomaly Detection Techniques for Sequence Data”
Varun Chandola, Varun Mithal, and Vipin Kumar
DM909 “Classifying High-Dimensional Text and Web Data using Very Short Patterns”
Hassan Malik and John Kender
DM921 “Document-Word Co-Regularization for Semi-supervised Sentiment Analysis”
Vikas Sindhwani and Prem Melville
DM934 “Iterative Subgraph Mining for Principal Component Analysis”
Hiroto Saigo and Koji Tsuda
DM942 “Releasing the SVM Classifier with Privacy-Preservation”
Keng-Pei Lin and Ming-Syan Chen