Selected Publications [Full Publication List]



"T-FSM: A Task-Based System for Massively Parallel Frequent Subgraph Pattern Mining from a Big Graph" (SIGMOD 2023)
Lyuheng Yuan, Da YAN, Wenwen Qu, Saugat Adhikari, Jalal Khalil, Cheng Long, Xiaoling Wang.


"Rethinking Graph Lottery Tickets: Graph Sparsity Matters" (ICLR 2023)
Bo Hui, Da YAN, Xiaolong Ma, Wei-Shinn Ku.


"Reinforcement Learning Enhanced Weighted Sampling for Accurate Subgraph Counting on Fully Dynamic Graph Streams" (ICDE 2023)
Kaixin Wang, Cheng Long, Da YAN, Jie Zhang, H. V. Jagadish.


"Realistic Urban Traffic Simulation with Ride-Hailing Services: A Revisit to Network Kernel Density Estimation" (SIGSPATIAL 2022) (Best Paper Candidate)
Jalal Khalil, Da YAN, Lyuheng Yuan, Mostafa Jafarzadehfadaki, Saugat Adhikari, Virginia Sisiopiku, Zhe Jiang.


"Federated Fingerprint Learning with Heterogeneous Architectures" (ICDM 2022) (One of the best-ranked papers for KAIS invitation)
Tianshi Che, Zijie Zhang, Yang Zhou, Xin Zhao, Ji Liu, Zhe Jiang, Da YAN, Ruoming Jin, Dejing Dou.


"Quantifying and Reducing Registration Uncertainty of Spatial Vector Labels on Earth Imagery" (KDD 2022)
Wenchong He, Marcus Kriby, Zhe Jiang, Yiqun Xie, Xiaowei Jia, Da YAN, Yang Zhou.


"Maximal Directed Quasi-Clique Mining" (ICDE 2022)
Guimu Guo, Da YAN, Lyuheng Yuan, Jalal Khalil, Cheng Long, Zhe Jiang, Yang Zhou.


"Distributed Task-Based Training of Tree Models" (ICDE 2022)
Da YAN, Md Mashiur Rahman Chowdhury, Guimu Guo, Jalal Khalil, Zhe Jiang, Sushil Prasad.


"Mining Order-Preserving Submatrices Under Data Uncertainty: A Possible-World Approach and Efficient Approximation Methods" (ACM TODS 2022)
Ji Cheng, Da YAN, Wenwen Qu, Xiaotian Hao, Cheng Long, Wilfred Ng, Xiaoling Wang.


"Efficient Algorithms for Maximal k-Biplex Enumeration" (SIGMOD 2022)
Kaiqiang Yu, Cheng Long, Shengxin Liu, Da YAN.


"Time-sensitive POI Recommendation by Tensor Completion with Side Information" (ICDE 2022)
Bo Hui, Da YAN, Haiquan Chen, Wei-Shinn Ku.


"Parallel Mining of Large Maximal Quasi-Cliques" (VLDB Journal 2022) [LINK]
Jalal Khalil, Da YAN, Guimu Guo, Lyuheng Yuan.


"G-thinker: A General Distributed Framework for Finding Qualified Subgraphs in a Big Graph with Load Balancing" (VLDB Journal 2022) [LINK]
Da YAN, Guimu Guo, Jalal Khalil, M. Tamer Özsu, Wei-Shinn Ku, John C.S. Lui.


"PrefixFPM: A Parallel Framework for General-Purpose Mining of Frequent and Closed Patterns" (VLDB Journal 2022) [LINK]
Da YAN, Wenwen Qu, Guimu Guo, Xiaoling Wang, Yang Zhou.


"Scalable de Novo Genome Assembly Using a Pregel-Like Graph-Parallel System" (IEEE/ACM TCBB 2021) [PDF]
Guimu Guo, Hongzhi Chen, Da YAN, James Cheng, Jake Chen, and Zechen Chong.


"TrajNet: A Trajectory-Based Deep Learning Model for Traffic Prediction" (KDD 2021)
Bo Hui, Da YAN, Haiquan Chen, Wei-Shinn Ku.


"Weakly Supervised Spatial Deep Learning based on Imperfect Training Labels with Location Errors" (KDD 2021)
Zhe Jiang, Wenchong He, Marcus Kriby, Sultan Asiri, Da YAN.


"Expressive 1-Lipschitz Neural Networks for Robust Multiple Graph Learning against Adversarial Attacks" (ICML 2021)
Xin Zhao, Zeru Zhang, Zijie Zhang, Lingfei Wu, Jiayin Jin, Yang Zhou, Ruoming Jin, Dejing Dou, Da YAN.


"EDGE: Entity-Diffusion Gaussian Ensemble for Interpretable Tweet Geolocation Prediction" (ICDE 2021) [PDF]
Bo Hui, Haiquan Chen, Da YAN, Wei-Shinn Ku.


"Scalable Mining of Maximal Quasi-Cliques: An Algorithm-System Codesign Approach" (PVLDB 2020) [PDF], [FULL REPORT]
Guimu Guo, Da YAN, M. Tamer Özsu, Zhe Jiang, Jalal Khalil.


"G-thinker: A Distributed Framework for Mining Subgraphs in a Big Graph" (ICDE 2020) [PDF], [VIDEO]
Da YAN, Guimu Guo, Md Mashiur Rahman Chowdhury, M. Tamer Özsu, Wei-Shinn Ku, John C.S. Lui.


"PrefixFPM: A Parallel Framework for General-Purpose Frequent Pattern Mining" (ICDE 2020) [PDF], [VIDEO]
Da YAN, Wenwen Qu, Guimu Guo, Xiaoling Wang.


"Spatial Classification With Limited Observations Based On Physics-Aware Structural Constraint" (AAAI 2020) [PDF]
Arpan Man Sainju, Wenchong He, Zhe Jiang, Da YAN.


"Systems and Algorithms for Massively Parallel Graph Mining (Tutorial)" (IEEE BigData 2020) [PPT], [VIDEO1], [VIDEO2]
Guimu Guo and Da YAN.


"Lightweight Fault Tolerance in Pregel-Like Systems" (ICPP 2019) [PDF], [PPT]
Da YAN, James Cheng, Hongzhi Chen, Cheng Long, Purushotham Bangalore.


"T-thinker: A Task-Centric Distributed Framework For Compute-Intensive Divide-and-Conquer Algorithms" (PPoPP 2019) [PDF]
Da YAN, Guimu Guo, Md Mashiur Rahman Chowdhury, M. Tamer Özsu, John C.S. Lui, Weida Tan.


"Mining Order-Preserving Submatrices Under Data Uncertainty: A Possible-World Approach" (ICDE 2019) [PDF]
Ji Cheng, Da YAN, Xiaotian Hao, Wilfred Ng.


"Fraction-Score: A New Support Measure for Co-location Pattern Mining" (ICDE 2019) [PDF]
Harry Kai-Ho Chan, Cheng Long, Da YAN, Raymond Chi-Wing Wong.


"GraphD: Distributed Vertex-Centric Graph Processing Beyond the Memory Limit" (TPDS 2018) [PDF]
Da YAN, Yuzhen Huang, Miao Liu, Hongzhi Chen, James Cheng, Huanhuan Wu, Chengcui Zhang.


"Scalable De Novo Genome Assembly Using Pregel" (ICDE 2018) [PDF]
Da YAN, Hongzhi Chen, James Cheng, Zhenkun Cai, Bin Shao


"Big Graph Analytics Platforms" (Foundations and Trends in Databases 2017) [PDF]
Da YAN, Yingyi Bu, Yuanyuan Tian, Amol Deshpande


"Architectural Implications on the Performance and Cost of Graph Analytics Systems" (SoCC 2017) [PDF]
Qizhen Zhang, Hongzhi Chen, Da YAN, James Cheng, Boon Thau Loo and Purushotham Bangalore


"Diversified Temporal Subgraph Pattern Mining" (KDD 2016) [PDF]
Yi Yang, Da YAN, Huanhuan Wu, James Cheng, Shuigeng Zhou, John C.S. Lui


"Big Graph Analytics Systems" (SIGMOD 2016) [PDF], [PPT]
Da YAN, Yingyi Bu, Yuanyuan Tian, Amol Deshpande, James Cheng


"Quegel: A General-Purpose System for Querying Big Graphs" (SIGMOD 2016) [PDF]
Qizhen Zhang, Da YAN, James Cheng


"A General-Purpose Query-Centric Framework for Querying Big Graphs" (PVLDB 2016) [PDF], [PPT]
Da YAN, James Cheng, M. Tamer Özsu, Fan Yang, Yi Lu, John C.S. Lui, Qizhen Zhang, Wilfred Ng


"Effective Techniques for Message Reduction and Load Balancing in Distributed Graph Computation" (WWW 2015) [PDF], [PPT]
Da YAN, James Cheng, Yi Lu and Wilfred Ng


"Efficient Processing of Optimal Meeting Point Queries in Euclidean Space and Road Networks" (KAIS 2015) [PDF]
Da YAN, Zhou Zhao and Wilfred Ng


"Probabilistic Convex Hull Queries over Uncertain Data" (TKDE 2015) [PDF]
Da YAN, Zhou Zhao, Wilfred Ng and Steven Liu


"Large-Scale Distributed Graph Computing Systems: An Experimental Evaluation" (PVLDB 2014) [PDF]
Yi Lu, James Cheng, Da YAN and Huanhuan Wu


"Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs" (PVLDB 2014) [PDF], [PPT]
Da YAN, James Cheng, Yi Lu and Wilfred Ng


"Pregel Algorithms for Graph Connectivity Problems with Performance Guarantees" (PVLDB 2014) [PDF], [PPT]
Da YAN, James Cheng, Kai Xing, Yi Lu, Wilfred Ng and Yingyi Bu


"Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases" (TKDE 2014) [PDF]
Zhou Zhao, Da YAN and Wilfred Ng


"Finding Distance-Preserving Subgraphs in Large Road Networks" (ICDE 2013) (Student Travel Award) [PDF], [PPT]
Da YAN, James Cheng, Wilfred Ng and Steven Liu


"Monochromatic and Bichromatic Reverse Nearest Neighbor Queries on Land Surfaces" (CIKM 2012) [PDF], [PPT]
Da YAN, Zhou Zhao and Wilfred Ng


"Mining Probabilistically Frequent Sequential Patterns in Uncertain Databases" (EDBT 2012) [PDF], [PPT]
Zhou Zhao, Da YAN and Wilfred Ng


"Efficient Algorithms for Finding Optimal Meeting Point on Road Networks" (PVLDB 2011) [PDF], [PPT]
Da YAN, Zhou Zhao and Wilfred Ng


"Robust Ranking of Uncertain Data" (DASFAA 2011) (Best Paper Award) [PDF], [PPT]
Da YAN and Wilfred Ng