Department of Computer Science and Technology
Joined Department: 2001
Bachelor of Computer Science & Technology, Shanxi University, Taiyuan, China, 1986;
Master of Computer Science & Technology, Shanxi University, Taiyuan, China, 1989;
Ph.D in Department of Computer Science and Technology, Tsinghua University, Beijing, China, 2000
Institute of Software and Theory, Department of Computer Science & Technology: Deputy Director (2004-);
China Computer Federation: Deputy Director of Chinese Information Processing Technology Committee (2010-);
Asian Semantic Web Conference (ASWC): Steering Committee Member (2007-2009);
ASWC 2006: Local Organization Chair (2006).
Areas of Research Interests/ Research Projects
Semantic Web, Semantic Web Services
News and Social Network Mining
National Natural Science Foundation of China: Large-Scale Dynamic Ontology Matching (2010-2012);
National Natural Science Foundation of China: Semantic-based Content Management and its Application in Specific Domains (2006-2008);
National Science Foundation of China: Research on the Partition Mechanism of Ontology Granularity in Distributed Systems (2004);
National Basic Research Program of China (973 Program): Validation and Management of Requirement Models (2008-2012);
Project Funded by Xinhua News Agency: CNML (Chinese News Markup Language) Management System (2007-2008);
Project Funded by Nucleus and Radiation Safety Centre, Ministry of Environmental Protection: XML Specification and Data Exchange for Nucleus Radiation Database (2009-2012);
International Joint Project with IBM SUR: Leveraging Collective Intelligence to Mine Linkages among Data for a Smarter City (2010- 2011);
International Joint Project with IBM SUR: Large-Scale Data Integration of Ontology and Instance Matching based on Active Learning (2009-2010);
International Joint Project with IBM SUR: Smashing and Querying Distributed Databases using Semantic Web Technologies (2008-2009);
International Joint Project with IBM SUR: Key Techniques of Semantic Content Management (2006-2007);
"Tsinghua-Leuven" International Joint Project: WebInsight: Research on Modeling Correlation and Evolution of Web Documents (2009-2011).
National Natural Science Foundation of China (key project): Cloud Based Large Scale Data Mining (2011-2014)
Xinhua News Agency: Updating for CNML Management System (2010-2011)
I received the Ph.D. degree from Department of Computer Science & Technology at Tsinghua University in 2000 and have been working here since I finished my research work as a post-doctor at Department of Electronic Engineering of Tsinghua University in 2001. My research areas are Semantic Web and Semantic Web Services, and text and social network mining. Currently, my research focuses on studying key technologies in semantic content management, and applying them in the domains of news, social networks, and web services. I have published about 90 papers in international journals and conferences including SIGIR, SIGMOD, SIGKDD, ISWC, CIKM, JoDS, JoWS etc.
1. Ontology-based semantic content management. In semantic content management, ontology matching and semantic annotation are our main research topics. In ontology matching, we proposed a Bayes decision-based ontology matching framework, and further proposed a dynamic ontology matching framework to solve the aggregation problem of different matching methods for difference matching tasks. Our related work has been published in many papers such as JoWS, TKDE, and SIGMOD, and the citation number of our JoWS paper by Google Scholar is 73. In semantic annotation, we have developed three models to annotate different kinds of resources with different characteristics, including rule-based model named iASA, an SVM classification-based method, and a CRFs-based model. These models have been applied in different annotation tasks such as research profiling, stock annual report extraction, and conference annotation. The above work is funded by NSFC and National 973 Basic Research Program.
2. News and Social Network Mining. In news mining, we proposed index tree and name entity-based topic detection and tracking method, and further proposed a topic-based news exploration framework. This research is published in SIGIR 2007 and demoed in SIGKDD 2009. In social network mining, we proposed an expertise-oriented social network search framework, which has been used in a practical academic social network mining system-Arnetminer. We have also studied the problems of expert finding, conference mining, and research interest finding.
3. XML application in news domain. I am the fourth author of China Standard "Chinese News Markup Language (CNML)" (GB/T20092-2006), and the PI for the project named CNML Management System. This software has been applied in many business systems of Xinhua News Agency, such as text, image and multimedia editing systems. We have won the second prize of Wang Xuan Award for Science and Technology Achievement in News Domain.
Honors And Awards
Wang Xuan Award for Science and Technology Achievement in News Domain, Second Class (2009).
Beijing, Science and Technology Progress Award, Second Class (2017)
《Mining User Generated Content》
《Semantic Mining in Social Networks》
 Jiaxin Shi, Lei Hou, Juanzi Li, Zhiyuan Liu, Hanwang Zhang: Learning to Embed Sentences Using Attentive Recursive Trees. AAAI （2019）
 Jiaxin Shi, Chen Liang, Lei Hou, Juanzi Li, Zhiyuan Liu, Hanwang Zhang:
DeepChannel: Salience Estimation by Contrastive Learning for Extractive Document Summarization. AAAI（2019）
 Jiaxin Shi, Hanwang Zhang, Juanzi Li:Explainable and Explicit Visual Reasoning over Scene Graphs. CVPR（2019）
 Hailong Jin, Lei Hou, Juanzi Li, Tiansi Dong: Attributed and Predictive Entity Embedding for Fine-Grained Entity Typing in Knowledge Bases. COLING 2018: 282-292（2018）
 Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu:Neural Collective Entity Linking. COLING 2018: 675-686（2018）
 Yixin Cao, Lei Hou, Juanzi Li, Zhiyuan Liu, Chengjiang Li, Xu Chen, Tiansi Dong:Joint Representation Learning of Cross-lingual Words and Entities via Attentive Distant Supervision. EMNLP 2018: 227-237（2018）
 Xin Lv, Lei Hou, Juanzi Li, Zhiyuan Liu:Differentiating Concepts and Instances for Knowledge Graph Embedding. EMNLP 2018: 1971-1979（2018）
 Jiangtao Zhang, Juanzi Li, Xiao-Li Li, Yixin Cao, Lei Hou, Shuai Wang: Is a Common Phrase an Entity Mention or Not? Dual Representations for Domain-Specific Named Entity Recognition. DASFAA (1) 2018: 830-846（2018）
 Jing Zhang, Jie Tang, Yuanyi Zhong, Yuchen Mo, Juanzi Li, Guojie Song, Wendy Hall, Jimeng Sun:StructInf: Mining Structural Influence from Social Streams. AAAI 2017: 73-80（2017）
 Linmei Hu, Juanzi Li, Liqiang Nie, Xiaoli Li, Chao Shao: What Happens Next? Future Subevent Prediction Using Contextual Hierarchical LSTM. AAAI 2017: 3450-3456（2017）
 Liangming Pan, Chengjiang Li, Juanzi Li, Jie Tang: Prerequisite Relation Learning for Concepts in MOOCs. ACL (1) 2017: 1447-1456（2017）
 Yixin Cao, Lifu Huang, Heng Ji, Xu Chen, Juanzi Li: Bridge Text and Knowledge by Learning Multi-Prototype Entity Mention Embedding. ACL (1) 2017: 1623-1633（2017）
 Yan Zhang, Thomas Paradis, Lei Hou, Juanzi Li, Jing Zhang, Haitao Zheng: Cross-Lingual Infobox Alignment in Wikipedia Using Entity-Attribute Factor Graph. International Semantic Web Conference (1) 2017: 745-760（2017）
 Jing Zhang, Jie Tang, Cong Ma, Hanghang Tong, Yu Jing, Juanzi Li, Walter Luyten, and Marie-Francine Moens. Fast and Flexible Top-k Similarity Search on Large Networks. ACM Transactions on Information Systems (TOIS), 2017, Volume 36, Issue 2, Article No. 13. (if =1.3) [PDF]
 Linmei Hu, Bin Zhang, Lei Hou, Juanzi Li:Adaptive online event detection in news streams. Knowl.-Based Syst. 138: 105-112 (2017)
 Lei Hou, Juanzi Li, Xiao-Li Li, Jie Tang, and Xiaofei Guo. Learning to Align Comments to News Topics. ACM Transactions on Information Systems (TOIS), 2017, Volume 36, Issue 1. (if =1.3) [PDF]