Life is full of Serendipity.
(In Korean: ÀλýÀ̶õ ¾ÆÁÖ ¸Õ ¿¾³¯¿¡ ½ÅÀÌ ³ª¸¦ À§Çؼ ¹Ì¸® ÁغñÇØ ³õÀº ¼ÒÁßÇÏ°í °ªÁø °ÍµéÀ» Çϳª¾¿ ã¾Æ °¡´Â ±â»Ý.)
-- Kyuseok Shim (In the year of 1999)--
Kyuseok Shim
Department of Electrical and Computer Engineering
Seoul National University
Kwanak P.O. Box 34
SEOUL 151-742, KOREA
Office (SNU): Building 302 - Room 531
Telephone: 82-2-880-7269 (Office) , 82-2-880-1758 (KDD Lab)
Fax: 82-2-871-5974
Email: kshim[AT]snu[DOT]ac[DOT]kr
Do your best and God will do the rest!
I am a Professor at
Department of Electrical and Computer Engineering
of Seoul National University, Korea.
I am also leading the
Knowledge Discovery and Database Research Laboratory at
Seoul National University.
Before that, I was an Assistant Professor at
Computer Science Department of
KAIST (Korea),
a member of technical staff (MTS) at Bell Laboratories (Murray Hill)
and a research staff at IBM Almaden Research Center (San Jose).
In Bell Laboratories, I was a key contributor with Rajeev Rastogi to
the Serendip data mining project and, in IBM Almaden Research Center, I was
a member of Quest Data Mining project led by Rakesh Agrawal.
I have visited Data Management, Exploration & Mining group in Microsoft Research
as a visiting scientist for the summer/winter breaks of 2001 and 2002.
I also worked as a summer intern mentored by Surajit Chaudhuri at Hewlett-Packard Laboratories for two summers in 1992 and 1993.
I received a Ph.D under the supervision of Professor Timos Sellis
in Computer Science from University of Maryland at College Park in 1993.
I received B.S. degree in Electrical Engineering from Seoul National University in 1986, and MS degree in
Computer Science from University of
Maryland at College Park in 1988.
My graduate study was supported by
University of Maryland at College Park and Korean Government Overseas Full Merit Scholarship.
I am currently an Editor-In-Chief of the VLDB Journal and was previously an Associate Editor for the IEEE TKDE, VLDB as well as PVLDB journals. I also served as a Program Co-chair for PAKDD 2003, WWW 2014, ICDE 2015, APWeb 2016, BigComp 2019 and ICDM 2019 conferences and have been serving on Program Committees of the leading database and data mining conferences including SIGMOD, SIGKDD, ICDE, ICDM, EDBT, VLDB, WWW and CIKM. I became an ACM fellow and an IEEE fellow for the contributions to scalable data mining and query processing in 2013 and 2019 respectively. I was a member of the VLDB Endowment Board of Trustees and am currently a steering committee member of PAKDD as well as DASFAA conferences. I also served as the president of the Korean Institute of Information Scientist and Engineers (KIISE) in 2022 and became a member of National Academy of Engineering of Korea in 2023. I have been working in the area of data mining, machine learning, privacy preservation, query processing, query optimization, data warehousing, semi-structured data (XML), stream data and histograms.
Research Interests
- Data Mining and Knowledge Discovery
- Query Processing and Optimization
- MapReduce Algorithms for Big Data Analysis
- Histograms and Wavelet Synopsis
- XML and Semi-structured Data
- Internet Stream Data
- Data Warehousing and OLAP
- Data Privacy and Security
- Embedded DBMS
Awards and Honors
- Member of National Academy of Engineering of Korea, 2023
- Korean Institute of Information Scientists and Engineers' Gaheon Award, 2019
- IEEE Fellow (For contributions to scalable data mining and database query processing), 2019
- Associate Member of National Academy of Engineering of Korea, 2019
-
ACM Fellow (For contributions to scalable data mining and query processing), 2013
- Best Teacher Award (For teaching Data Structure and Algorithm Class from Seoul National University), 2004
- Best Teacher Award (For teaching Data Structure and Algorithm Class from Seoul National University), 2003
- Korean Government Overseas Full Merit Scholarship (For Ph.D. studies from Korean Government), 1990 -- 1993
- Magna Cum Laude (For B.E. Degree from Seoul National University), 1986
- Minister of Education Award (For Software Contest from Ministry of Science and Technology, Korea), 1985
- Minister of Trading Award (For Software Contest from Ministry of Science and Technology, Korea), 1985
Work Experiences
- KAIST, Taejon, Korea (Feb. 1999 -- Feb. 2002)
- Microsoft Research, Redmond, WA (Jan., Feb., July and Aug. of 2001, Jan., Feb., June, July and Aug. of 2002)
- Bell Laboratories, Murray Hill, NJ (March 1996 -- February 2000, June -- August 2000).
- IBM Almaden Research Center, San Jose, CA (November 1994 -- March 1996)
- Federal Reserve Board, Washington, DC (September 1993 -- November 1994)
- Hewlett-Packard Laboratories, Palo Alto, CA (Summers of 1992 and 1993)
Projects
- Serendip Data Mining Project (1996--2000)
- Cosmos Constraint-based Database System Project (1998--1999)
- IBM Quest Data Mining Project (1994--1996)
- HP Smallbase Project (Summer of 1993)
- HP Papyrus Project (Summer of 1992)
Degrees
- Ph.D. in Computer Science, University of Maryland, College Park, 1993.
- M.S. in Computer Science, University of Maryland, College Park, 1988.
- B.E. (Ranked Top) in Electrical Engineering, Seoul National University, Seoul, Korea, 1986.
Tutorial Talks
- "MapReduce Algorithms for Big Data Analysis", VLDB Conference, 2012
- "Offline and Stream algorithms for efficient computation of synopsis structures", VLDB Conference, 2005
- "Analyzing and Mining Data Streams", PAKDD Conference, 2003
- "Storage and Retrieval of XML Data using Relational Databases", IEEE ICDE Conference, 2003
- "Storage and Retrieval of XML Data using Relational Databases", PAKDD Conference, 2002
- "Storage and Retrieval of XML data using Relational Databases", VLDB Conference, 2001
- "Recent Advances in Data Mining Algorithms for Large Databases", PAKDD Conference, 2001
- "Recent Advances in Data Mining Algorithms on Large Databases", CIKM Conference, 1999
- "Data Mining on Large Databases", IEEE ICDE Conference, 1999
- "Scalable Algorithms for Mining Databases", ACM SIGKDD 1999
Publication Citations
Recent Professional Activities
- 2022~ VLDB Journal: Editor-In-Chief
- 2026 EDBT Conference: Program Committee Member
- 2025 ACM SIGMOD Conference: Program Committee Member
- 2025 AAAI Conference: Program Committee Member
- 2025 EDBT Conference: Senior Program Committee Member
- 2025 ACM SIGKDD Conference: Area Chair and Lecture-Style Tutorial Chair
- 2025 WSDM Conference: Program Committee Member
- 2025 DASFAA Conference: General Chair
- 2024 ACM SIGKDD Conference: Area Chair
- 2024 ACM CIKM Conference: Senior Program Committee Member
- 2024 PAKDD Conference: Senior Program Committee Member
- 2024 DASFAA Conference: Senior Program Committee Member
- 2024 The Web Conference: Seoul Test of Time Award Committee
- 2024 WSDM Conference: Program Committee Member
- 2024 AAAI Conference: Program Committee Member
- 2024 VLDB Conference: Associate Editor
- 2024 EDBT Conference: Senior Program Committee Member
- 2024 ACM SIGMOD Conference: Program Committee Member
- 2024 APWeb-WAIM Conference: General Co-Chair
- 2024 TAAI Conference: Keynote Speaker
- 2024 IEEE International Conference on Big Data (IEEE BigData 2024): Best Paper Award Committee Member
- 2023 IEEE CS Fellow Evaluating Committee
- 2023 IEEE BigComp Conference: Best paper Award Committee Chair
- 2023 The Web Conference: Seoul Test of Time Award Committee
- 2023 PAKDD Conference: Best paper Award Committee Member
- 2023 VLDB Conference: Associate Editor
- 2023 WSDM Conference: Program Committee Member
- 2023 ACM SIGKDD Conference: Area Chair
- 2023 IEEE International Conference on Data Engineering (ICDE'23): Area Chair (Research Track)
- 2023 ACM SIGIR Conference: Program Committee Member
- 2023 DASFAA Conference: Keynote Speaker and Senior Program Committee Member
- 2023 International Conference on Advanced Data Mining and Applications (ADMA'23): Keynote Speaker
- 2022 Korean Institute of Information Scientists and Engineers: The President
- 2022 ACM CIKM Conference (CIKM'22): Senior Program Committee Member
- 2022 ACM SIGKDD Conference: Program Committee Member (Applied Data Science Track)
- 2022 ACM SIGKDD Conference: Best Paper Award Committee Member
- 2022 International Joint Conference on Artificial Intelligence (IJCAI-ECAI 22): Senior Program Committee (SPC) member
- 2022 ACM SIGIR Conference: Program Committee Member
- 2022 WSDM Conference: Program Committee Member
- 2022 ACM SIGMOD Conference: Program Committee Member
- 2022 IEEE International Conference on Data Engineering (ICDE'22): Area Chair (Research Track)
- 2021 IEEE International Conference on Data Mining (ICDM'21): Area Chair
- 2021 ACM SIGKDD Conference: Senior Program Committee Member
- 2021 ACM SIGIR Conference: Program Committee Member
- 2021 ACM CIKM Conference (CIKM'21): Program Committee Member
- 2021 WSDM Conference: Program Committee Member
- 2021 International Joint Conference on Artificial Intelligence (IJCAI'21): Area Chair
- 2021 IEEE International Conference on Data Engineering (ICDE'21): Sponsorship Co-Chair and Program Committee Member (Demonstrations Track)
- 2021 VLDB Conference: Workshop Co-Chair and Industrail Program Committee Member
- 2021 DASFAA Conference: Senior Program Committee Member
- 2020 ACM SIGMOD Conference: Program Committee Member
- 2020 ACM SIGKDD Conference: Senior Program Committee Member
- 2020 ACM CIKM Conference (CIKM'20): Program Committee Member
- 2020 ACM SIGIR Conference: Program Committee Member
- 2020 VLDB Conference: Program Committee Member
- 2020 IEEE International Conference on Data Mining (ICDM'20): Area Chair
- 2020 IEEE International Conference on Data Engineering (ICDE'20): Program Committee Member (Industry Track)
- 2020 DASFAA Conference: Senior Program Committee Member
- 2020 Asia Pacific Web and Web-Age Information Management Joint Conference on Web and Big Data (APWEB-WAIM'20): Program Committee Member
- 2020 IEEE International Conference on Data Science and Advanced Analytics (DSAA-2020): Senior Program Committee Member
- 2019 ACM CIKM Conference (CIKM'19): Program Committee Member
- 2019 VLDB Conference: Research Track Associate Editor
- 2019 ACM SIGKDD Conference: Senior Program Committee Member
- 2019 The Web Conference: Program Committee Member (Web Mining and Content Analysis Track)
- 2019 IEEE International Conference on Data Mining (ICDM'19): Program Committee Co-Chair
- 2019 PAKDD Conference: Senior Program Committee Member
- 2019 DASFAA Conference: Program Committee Member
- 2019 Asia Pacific Web and Web-Age Information Management Joint Conference on Web and Big Data (APWEB-WAIM'19): Program Committee Member
- 2019 IEEE International Conference on Big Data and Smart Computing (BigComp'19): Program Committee Co-Chair
- 2018 IEEE International Conference on Big Data (IEEE BigData 2018): Senior Program Committee Member
- 2018 IEEE International Conference on Data Mining (ICDM'18): Program Committee Member
- 2018 ACM CIKM Conference (CIKM'18): Program Committee Member
- 2018 Asia-Pacific Web Conference (APWeb-WAIM'18): Program Committee Member
- 2018 ICBK Conference: Program Committee Member
- 2018 ACM SIGKDD Conference: Senior Program Committee Member
- 2018 World Wide Web Conference: Program Committee Member (Web Content Analysis, Semantics, and Knowledge Track)
- 2018 IEEE International Conference on Data Engineering (ICDE'18): Program Committee Member (Industry Track)
- 2018 VLDB Conference: Program Committee Member
- 2018 DASFAA Conference: Senior Program Committee Member
- 2018 PAKDD Conference: Senior Program Committee Member
- 2018 SIAM International Conference on Data Mining (SDM'18): Senior Program Committee Member
- 2017 ACM CIKM Conference (CIKM'17): Program Committee Member
- 2017 IEEE International Conference on Data Mining (ICDM'17): Program Committee Member
- 2017 PAKDD Conference: General Co-Chair
- 2017 World Wide Web Conference: Program Committee Member (Web Mining & Content Analysis Track)
- 2017 SIAM International Conference on Data Mining: Senior Program Committee Member
- 2017 Asia-Pacific Web Conference (APWeb-WAIM'17): Program Committee Member
- 2017 ICBK Conference: Program Committee Member
- 2017 IEEE International Conference on Big Data and Smart Computing (BigComp'17): Program Committee Member
- 2016 ACM SIGKDD Conference: Program Committee Member
- 2016 VLDB Conference: Research Track Associate Editor
- 2016 IEEE International Conference on Data Engineering (ICDE'16): Program Committee Member (Research Track)
- 2016 IEEE International Conference on Data Mining (ICDM'16): Program Committee Member
- 2016 World Wide Web Conference: Program Committee Member (Content Analysis Track)
- 2016 ACM CIKM Conference (CIKM'16): Program Committee Member
- 2016 Asia-Pacific Web Conference (APWeb'16): Program Committee Co-Chair
- 2016 PAKDD Conference: Senior Program Committee Member
- 2016 BigComp Conference: Program Committee Member
- 2016 DASFAA Conference: Program Committee Member
- 2016 WAIM Conference: Program Committe Member
- 2015 ACM SIGKDD Conference: Doctoral Dissertation Award Committee Chair
- 2015 ACM SIGKDD Conference: Program Committee Member
- 2015 IEEE International Conference on Data Engineering (ICDE'15): Program Committee Co-Chair (Research Track)
- 2015 IEEE International Conference on Data Mining (ICDM'15): Program Committee Member
- 2015 World Wide Web Conference: Program Committee Member (Content Analysis Track)
- 2015 PAKDD Conference: Senior Program Committee Member
- 2015 DASFAA Conference: Program Committee Member
- 2015 WAIM Conference: Program Committe Member
- 2015 VLDB Conference: Program Committee Member
- 2015 BigComp Conference: Program Committee Member
- 2015 COMAD 2005: Program Committee Member
- 2014 IEEE International Conference on Big Data (IEEE BigData 2014): Senior Program Committee Member
- 2014 World Wide Web Conference: Program Committee Co-Chair (Research Track)
- 2014 ACM SIGMOD Conference: Program Committee Member
- 2014 ACM SIGKDD Conference: Program Committee Member
- 2014 IEEE International Conference on Data Mining (ICDM'14): Program Committee Member
- 2014 VLDB Conference: Research Track Associate Editor
- 2014 APWeb Conference: Tutorial Chair
- 2014 PAKDD Conference: Senior Program Committee Member
- 2014 EDBT/ICDT 2014 workshop on MapReduce: Program Committee Member
- 2014 ASONAM Conference: Program Committee Member
- 2013 ACM SIGKDD Conference: Program Committee Member
- 2013 ACM SIGMOD Conference: Demonstration Program Committee Member
- 2013 World Wide Web Conference: Track Chair (Web Mining Track)
- 2013 ICDT Conference: Program Committee Member
- 2013 ACM CIKM Conference (CIKM'13): Senior Program Committee Member (Databases Track)
- 2013 IEEE International Conference on Data Mining (ICDM'13): Vice Chair
- 2013 IEEE International Conference on Big Data (IEEE BigData 2013): Program Committee Member
- 2013 PAKDD Conference: Senior Program Committee Member
- 2013 WAIM Conference: Workshop Co-Chair & Program Committe Member
- 2013 MDM Conference: Program Committee Member
- 2013 ASONAM Conference: Program Committee Member
- 2012 ACM SIGKDD Conference: Program Committee Member
- 2012 ACM SIGMOD Conference: Program Committee Member
- 2012 IEEE International Conference on Data Mining (ICDM'12): Program Committee Member
- 2012 IEEE International Conference on Data Engineering (ICDE'12): Program Committee Member
- 2012 VLDB Conference, Istanbul, Turkey: Program Committee Member
- 2012 ACM CIKM Conference (CIKM'02): Program Committee Member
- 2012 PAKDD Conference: Senior Program Committee Member
- 2012 DASFAA Conference, Pusan, Korea: Panel Chair
- 2011 VLDB Conference: Program Committee Member
- 2011 IEEE International Conference on Data Mining (ICDM'11): Program Committee Member
- 2011 IEEE International Conference on Data Engineering (ICDE'11): Program Committee Member (Industrial Track)
- 2011 SIAM International Conference on Data Mining (SDM'11): Program Committee Member (Industrial Track)
- 2011 ACM CIKM Conference (CIKM'01): Program Committee Member (Knowledge Management track)
- 2011 International Conference on Emerging Databases (EDB'11): Program Committee Co-Chair
- 2011 International Conference on Advanced Data Mining and Applications (ADMA'11): Vice PC Chair
- 2010 IEEE International Conference on Data Mining (ICDM'10): Program Committee Vice-Chair
- 2010 VLDB Conference: Program Committee Member
- 2010 International Conference on Extending Database Technology (EDBT'10): Program Committee Member
- 2010 World Wide Web Conference: Area Chair (Data Mining and Machine Learning Track)
- 2009 ACM SIGKDD Conference: Program Committee Member
- 2009 ACM SIGMOD Conference: Program Committee Member
- 2009 World Wide Web Conference: Program Committee Member (Data Mining Track)
- 2009 ICDT Conference: Program Committee Member
- 2009 IEEE International Conference on Data Engineering (ICDE'09): Vice-Chair (Mining Data and Knowledge Discovery)
- 2008 IEEE International Conference on Data Mining (ICDM'08): Program Committee Vice-Chair
- 2008 ACM SIGKDD Conference: Program Committee Member
- 2008 VLDB Conference: Program Committee Member
- 2008 World Wide Web Conference (WWW'08):
Program Committee Member (Data Mining Track)
- 2008 ACM SIGMOD Conference: Program Committee Member
- 2008 IEEE International Conference on Data Engineering (ICDE'08): Program Committee Member (Mining Data, Text, and the Web)
- 2008 IEEE International Conference on Cooperative Information Systems (CoopIS'08): Program Committee Member
- 2007 World Wide Web Conference (WWW'07): Deputy Chair (Data Mining)
- 2007 International Conference on Scientific and Statistical Database (SSDBM'07): Program Committee Member
- 2007 ACM SIGMOD Conference: Program Committee Member
- 2007 ACM SIGKDD Conference: Senior Program Committee Member
- 2007 IEEE International Conference on Data Engineering (ICDE'07): Vice-Chair (Mining Data, Text, and the Web)
- 2006 ACM SIGMOD Conference: Program Committee Member
- 2006 IEEE International Conference on Data Engineering (ICDE'06): Program Committee Member
- 2006 VLDB Conference: Tutorial Co-Chair & Program Committee Member
- 2006 International Conference on Extending Database Technology (EDBT'06): Program Committee Member
- 2005 ACM SIGKDD Conference: Program Committee Member
- 2005 ACM SIGMOD Conference: Program Committee Member
- 2005 IEEE International Conference on Data Engineering (ICDE'05): Program Committee Vice-Chair (Mining Data, Text and Web)
- 2005 International Conference on Database Theory (ICDT'05): Program Committee Member
- 2005 VLDB Conference Demonstration Track: Program Committee Member
- 2005 COMAD 2005: Program Committee Member
- 2005 PAKDD Conference: Workshop Chair & Program Committee Member
- 2004 IEEE International Conference on Data Mining (ICDM'04): Tutorial Chair & Program Committee Member
- 2004 ACM SIGKDD Conference: Program Committee Member
- 2004 PAKDD Conference: Program Committee Member
- 2004 VLDB Conference: Program Committee Member
- 2004 IEEE International Conference on Data Engineering (ICDE'04): Program Committee Member
- 2004 SIAM Data Mining Conference: Program Committee Member
- 2004 International Conference on Database Systems for Advanced Applications (DASFAA'04): Tutorial Co-Chair
- 2004 ACM CIKM Conference: Program Committee Member
- 2003 IEEE International Conference on Data Mining: Program Committee Member
- 2003 PAKDD Conference: Program Committee Co-Chair & Stream Data Mining Tutorial
- 2003 ACM SIGMOD Conference: Program Committee Member
- 2003 IEEE International Conference on Data Engineering (ICDE'03): XML Tutorial
- 2003 WAIM Conference: Program Committee Member
- 2003 DaWak Conference: Program Committee Member
- 2003 International Conference on IDEAL:Program Committee Member
- 2003 ACM Workshop on Data Warehousing and OLAP: Program Committee Member
- 2003 International Workshop on Web Information and Data Management (WIDM'03): Program Committee Member
- 2003 ACM SIGMOD DMKD Workshop: Program Committee Member
- 2002 IEEE International Conference on Data Mining:Vice-Chair for Database Topics
- 2002 ACM SIGKDD Conference: Program Committee Member
- 2002 SIAM Data Mining Conference: Program Committee Member
- 2002 ICDE: Program Committee Member
- 2002 VLDB Conference: Program Committee Member
- 2002 EDBT Conference: Program Committee Member
- 2002 PAKDD Conference: Program Committee Member
- 2002 ACM WIDM Workshop: Program Committee Member
- 2002 ACM Workshop on Data Warehousing and OLAP: Program Committee Member
- 2002 ACM SIGMOD DMKD Workshop: Program Committee Member
- 2002 DaWaK 2002 Conference: Program Committee Member
- 2001 VLDB Conference: Tutorial on Storage and Retrieval of XML Data using Relational DB
- 2001 ACM SIGMOD DMKD Workshop: Program Committee Member
- 2001 WECWIS Workshop: Program Committee Member
- 2001 ACM SIGKDD Conference: Program Committee Member (Research and Industrial Tracks)
- 2001 WAIM Conference: Program Committee Member
- 2001 IJCAI Workshop on Knowledge Discovery from Distributed, Dynamic, Heterogeneous, Autonomous Data and Knowledge Sources: Co-Chair
- 2001 DaMeB Workshop: Program Committee Member
- 2001 PAKDD Conference: Program Committee Member
- 2001 SIAM Data Mining Conference: Program Committee Member
- 2001 ACM Workshop on Data Warehousing and OLAP: Program Committee Member
- 2001 Data Warehousing and Knowledge Discovery Conference: Program Committee Member
- 2000 SIGKDD Explorations 2:(2): Guest Editor
- 2000 VLDB Conference: Program Committee Member
- 2000 IEEE ICDE Conference: Program Committee Member
- 2000 ACM CIKM Conference: Program Committee Member
- 2000 International Conference on Information Society (IS2000): Program Committee Member
- 2000 Data Warehousing and Knowledge Discovery Conference: Program Committee Member
- 2000 ACM SIGMOD DMKD Workshop : Program Committee Member
- 2000 Discovery Science Conference: Program Committee Member
- 2000 WECWIS Workshop: Program Committee Member
- 2000 Korean Database Conference: Program Committee Member
- 2000 Asian Computing Science Conference: Program Committee Member
- 2000 M3W3 Workshop: Program Committee Member
- 2000 PAKDD Workshop: Program Committee Member
- 2000 Multimedia Information Systems Workshop: Program Committee Member
- 2000 ACM Workshop on Data Warehousing and OLAP: Program Committee Member
- 1999 ACM SIGKDD Conference: Program Committee Member & Proceedings Chair
- 1999 ACM SIGMOD Conference: Program Committee Member
- 1999 ACM SIGMOD DMKD Workshop : Workshop Co-Chair
- 1999 ACM CIKM Conference: Program Committee Member
- 1999 ACM Workshop on Data Warehousing and OLAP: Program Committee Member
- 1999 SPIE DMKD Conference: Program Committee Member
- 1999 IEEE ICDE Conference: Tutorial on Data Mining
- 1999 CIKM Conference: Tutorial on Recent Advances in Data Mining Algorithms on Large Databases
- 1998 CIKM Conference: Tutorial on Data Mining
- 1998 ACM Workshop on Data Warehousing and OLAP: Program Committee Member
- 1998 NSF Proposal Panelist
- 1998 KDD Conference: Program Committee Member & Local Arrangement Chair
- 1997 IEEE ICDE Conference: Program Committee Member
PUBLICATIONS
[List of publications from the DBLP Bibliography Server]
- "Cardinality Estimation of Approximate Substring Queries using Deep Learning"
(with Suyong Kwon and Woohwan Jung)
VLDB 2022
- "TIDY: Publishing a Time Interval Dataset with Differential Privacy"
(with Woohwan Jung and Suyong Kwon)
To appear to IEEE Transactions on Knowledge and Data Engineering Journal, 2021
- "Dual Supervision Framework for Relation Extraction with Distant Supervision and Human Annotation"
(with Woohwan Jung)
the 28nd 28th International Conference on Computational Linguistics (COLING), Barcelona, Spain, 2020
- "T-REX: A Topic-Aware Relation Extraction Model" (Short Paper)
(with Woohwan Jung)
the 29nd ACM International Conference on Information and Knowledge Management (CIKM), 2020
- "String Joins with Synonyms"
(with Gwangho Song, Hongrae Lee, Yoonjae Park, Wooyeol Kim)
25th International Conference on Database Systems for Advanced Applications (DASFAA), 2020
- "Efficient two-dimensional Haar + synopsis construction for the maximum absolute error measure"
(with Jinhyun Kim and Jun-Ki Min)
VLDB Journal 28(5), July 2019
- "Efficient Aggregation Processing in the Presence of Duplicately Detected Objects in WSNs"
(with Jun-Ki Min and Raymond Ng)
Journal of Sensors, May 2019
- "Crowdsourced Truth Discovery in the Presence of Hierarchies for Knowledge Fusion"
(with Woohwan Jung and Younghoon Kim)
the 22nd International Conference on Extending Database Technology (EDBT), March, 2019
- "Efficient Haar+ Synopsis Construction for the Maximum Absolute Error Measure"
(with Jinhyun Kim and Jun-Ki Min)
PVLDB 11(1): 2017
- "Efficient Processing of Skyline Queries Using MapReduce"
(with Yoonjae Park and Jun-Ki Min)
IEEE Transactions on Knowledge and Data Engineering Journal, 25(9): 2017
- "Integration of graphs from different data sources using crowdsourcing"
(with Younghoon Kim and Woohwan Jung)
Information Sciences Journal, Elsevier 358: 2017
- "Processing of Probabilistic Skyline Queries Using MapReduce"
(with Yoonjae Park and Jun-Ki Min)
PVLDB 8(12): 2015
- "Aggregate query processing in the presence of duplicates in wireless sensor networks"
(with Jun-Ki Min and Raymond T. Ng)
Information Sciences Journal, Elsevier 297: 2015
- "TWINS: Efficient time-windowed in-network joins for sensor networks"
(with Jun-Ki Min and Jinhyun Kim)
Information Sciences Journal, Elsevier 263: 2014
- "Parallel Computation of Skyline and Reverse Skyline Queries Using MapReduce"
(with Yoonjae Park and Jun-Ki Min)
PVLDB 6(14): 2002-2013 (2013)
- "Efficient Processing of Substring Match Queries with Inverted Variable-length Gram Indexes"
(with Younghoon Kim, Hyoungmin Park and Kyoung-Gu Woo)
Information Sciences Journal, Elsevier 244: 2013
- "Efficient Top-k Algorithms for Approximate Substring Matching"
(with Younghoon Kim)
ACM SIGMOD International Conference on Management of Data, New York, USA, 2013
- "Parallel Top-K Similarity Join Algorithms Using MapReduce"
(with Younghoon Kim)
the 28th International Conference on IEEE Data Engineering, Washington D. C. USA, 2012
- "CATCH: A detecting algorithm for coalition attacks of hit inflation in internet advertising"
(with Chulyun Kim and Hui Miao)
Information Systems Journal, Elsevier, 36(8): 2011
- "TEXT: Automatic Template Extraction from Heterogenous Web Pages"
(with Chulyun Kim)
IEEE Transactions on Knowledge and Data Engineering Journal, 23(4): 2011
- "Similarity Join Size Estimation using Locality Sensitive Hashing"
(with Hongrae Lee and Raymond Ng)
the 37th International Conference on VLDB, Seattle, USA, 2011
- "Approximate Algorithms with Generalizing Attribute Values for
k-Anonymity "
(with Hyoungmin Park)
Information Systems Journal, Elsevier, 35(8): 2010
- "Power-Law Based Estimation of Set Similarity Join Size"
(with Hongrae Lee and Raymond Ng)
the 35th International Conference on VLDB, Lyon, France, 2009
- "Approximate Substring Selectivity Estimation"
(with Hongrae Lee and Raymond Ng)
the 12th International Conference on Extending Database Technology (EDBT), March, 2009
- "Wavelet Synopsis for Hierarchical Range Queries with Workloads"
(with Sudipto Guha and Hyoungmin Park)
VLDB Journal 17(5), August 2008
- "Extending Q-Grams to Estimate Selectivity of String Matching with Edit Distance"
(with Hongrae Lee and Raymond Ng)
the 33th International Conference on VLDB, Vienna, Austria, 2007
- "Approximate Algorithms for K-Anonymity"
(with Hyoungmin Park)
ACM SIGMOD International Conference on Management of Data, Beijing, China, 2007
- "SQUIRE: Sequential Pattern Mining with Quantities"
(with Chulyun Kim, JongHwa Lim and Raymond T. Ng)
Journal of Systems and Software, Elsevier, 80(10): 2007
- "SQUIRE: Sequential Pattern Mining with Quantities"
(with Chulyun Kim, JongHwa Lim and Raymond T. Ng)
the 20th International Conference on IEEE Data Engineering, Boston, USA, 2004
- "WALRUS: A Similarity Retrieval Algorithm for Image Databases"
(with Apostol Natsev and Rajeev Rastogi)
IEEE Transactions on Knowledge and Data Engineering Journal 16(3): 2004
- "Approximation and Streaming Algorithms for Histogram Construction Problems"
(with Sudipto Guha and Nick Koudas)
ACM Transaction on Database Systems 31:(1), March 2006
- "Storing XML (with XSD) in SQL Databases: Interplay of Logical and Physical designs"
(with Zhiyuan Chen, Surajit Chaudhuri and Yuqing Wu)
IEEE Transactions on Knowledge and Data Engineering Journal 17:(12), 2005
- "An Adaptive Path Index for XML Data using the Query Workload"
(with Chin-Wan Chung and Jun-Ki Min)
Information Systems Journal 30:(6) Elsevier, 2005
- "Storing XML (with XSD) in SQL Databases: Interplay of Logical and Physical designs "
(with Zhiyuan Chen, Surajit Chaudhuri and Yuqing Wu)
the 20th International Conference on IEEE Data Engineering, Boston, USA, 2004
- "XWAVE: Approximate Extended Wavelets for Streaming Data"
(with Sudipto Guha and Chulyun Kim)
the 30th International Conference on VLDB, Toronto, Ontario, Canada, 2004
- "REHIST: Relative Error Histogram Construction Algorithms"
(with Sudipto Guha and Jungchul Woo)
the 30th International Conference on VLDB, Toronto, Ontario, Canada, 2004
- "Optimizing Queries with Materialized Views"
(with Surajit Chaudhuri, Ravi Krishnamurthy and Spyros Potamianos)
Materialized Views (Techniques, Implementations, and Applications), Edited by Ashish Gupta and Inderpal Singh Mumick, The MIT Press, 1999
- "Parametric Query Optimization"
(with Yannis E. Ioannidis, Raymond T. Ng and Timos K. Sellis)
the 18th International Conference on VLDB, Vancouver, Canada, 1992
Miscellaneous
US Patents
- K-means clustering based data mining system and method using the same
(with Seog Park, Hanjun Goo, Woohwan Jung, Seongwoong Oh, Suyong Kwon)
United States Patent No. US-11016995. Issued on May 15, 2022
- System and method for skyline queries
(with Yoonjae Park and Jun-Ki Min)
United States Patent No. 9,977,806. Issued on May 22, 2018
- Method and apparatus for processing a query
(with Younghoon Kim, Hyoungmin Park and Kyoung-gu Woo)
United States Patent No. 9,110,973. Issued on August 18, 2015
- Method of processing query about XML data using APEX
(with Jun-Ki Min and Chin-Wan Chung)
United States Patent No. 7,260,572. Issued on August 21, 2007
- Transformation tool for mapping XML to relational database
(with Surajit Chaudhuri, Zhiyuan Chen, Yuqing Yu)
United States Patent No. 7,228,312. Issued on June 5, 2007
- Document descriptor extraction method
(with Minos N. Garofalakis, Aristides Gionis, Rajeev Rastogi, Srinivasan Seshadri)
United States Patent No. 7,080,314. Issued on July 18, 2006
- Approximate query processing using wavelets
(with Kaushik Chakrabarti, Minos N. Garofalakis, Rajeev Rastogi)
United States Patent No. 6,760,724. Issued on July 06, 2004
- Methods of imaging based on wavelet retrieval of scenes
(with Apostol Ivanov Natsev, Rajeev Rastogi)
United States Patent No. 6,751,363. Issued on June 15, 2004
- Method for identifying outliers in large data sets
(with Sridhar Ramaswamy and Rajeev Rastogi)
United States Patent No. 6,643,629. Issued on November 04, 2003
- System and method for constraint based sequential pattern mining
(with Minos N. Garofalkis and Rajeev Rastogi)
United States Patent No. 6,473,757. Issued on October 29, 2002
- Decision tree classifier with integrated building and pruning phases
(with Rajeev Rastogi)
United States Patent No. 6,247,016. Issued on June 12, 2001
- Method for mining association rules in data
(with Rajeev Rastogi)
United States Patent No. 6,185,549. Issued on February 6, 2001
- A method, apparatus and programmed medium for clustering large databases
(with Sudipto Guha and Rajeev Rastogi)
United States Patent No. 6,092,072. Issued on July 18, 2000
- A method, apparatus and programmed medium for clustering databases with categorical attributes
(with Sudipto Guha and Rajeev Rastogi)
United States Patent No. 6,049,797. Issued on April 11, 2000
- Method and system for performing spatial similarity joins on high-dimensional points
(with Rakesh Agrawal and Ramakrishnan Srikant)
United States Patent No. 5,978,794. Issued on November 2, 1999
- Technique for effectively instantiating attributes in association rules
(with Rajeev Rastogi)
United States Patent No. 5,946,683. Issued on August 31, 1999
- System and method for discovering similar time sequences in database
(with Rakesh Agrawal, King-Ip Lin and Harpreet Singh Sawhney)
United States Patent No. 5,930,789. Issued on July 27, 1999
- System and method for tightly coupling application programs with relational databases
(with Rakesh Agrawal)
United States Patent No. 5,734,885. Issued on March 31, 1998
- System and method for discovering similar time sequences in database
(with Rakesh Agrawal, King-Ip Lin and Harpreet Singh Sawhney)
United States Patent No. 5,664,174. Issued on September 2, 1997
- Method and apparatus for query optimization in a relational database system having foreign functions
(with Surajit Chaudhuri)
United States Patent No. 5,544,355. Issued on August 6, 1996