Zhihui Yang

No. 1159, Cailun Road, Pudong District, Shanghai, China, (201203) zhyangcs@gmail.com

Now I am a visiting scholar co-advised by Chen Li at Department of Information and Computer Science, UC, Irvine since Sep. 2018. I am currently a PhD student advised by X. Sean Wang at Computer Science, Fudan University since Sep. 2014. Before that, I received my bachelor's degree in Information Science and Engineering, Lanzhou University in July, 2014.

My research interests include large scale text data analysis, exploratory data analysis, database system

My CV is here.


Fudan University, Shanghai, China

Computer Science
Advisor: Prof. X. Sean Wang

GPA: 3.566/4

September 2014 - Present

Lanzhou University

B.S. in Information Science and Engineering; (Outstanding Graduates)
Advisor: Prof. Wei Su

GPA: 4.77/5

Undergraduate Thesis: Chem2Dot: a CML to Chemical Braille Translation Software (Excellent Undergraduate Thesis)

September 2010 - July 2014


Zhihui Yang, Jiyang Gong, et al. iExplore: Accelerating Exploratory Data Analysis by Predicting User Intention[C]International Conference on Database Systems for Advanced Applications (DASFAA). Springer, Cham, 2018: 149-165.

Zhihui Yang, Huixin Ma, et al. Finding maximal ranges with unique topics in a text database[J]. World Wide Web, 2018, 21(2): 289-310.

Huixin Ma, Zhihui Yang, et al. Answering unique topic queries with dynamic threshold[J]. World Wide Web, 2018: 1-20.

Kaiwen Zhou, Zhihui Yang, et al. Design and development of partitional topic model. Journal of Frontiers of Computer Science and Technology, 2017. doi:10.3778/j.issn.1673-9418

Lvhong Liu, Zhihui Yang, et al. Unique topic query system based on relational information extraction. In The 34th national database conference, 2017.

Lvhong Liu, Zhihui Yang, et al. Unique topic query processing on cloud. IEEE International Conference on Cyber Security and Cloud Computing, 2018


Text Data Analysis, Laboratory for Data Analytics and Security (DAS Lab)

We introduced the concept of unique topics to discover topics that appear frequently within a small range of documents in contrast to the whole range.

We also proposed a pruning-based optimization (PBO) algorithm to find the maximal ranges of the specified unique topic. The PBO algorithm reduced the time complexity from O(n^3) to O(n^2). Additionally, we further reduced the time complexity to O(n).

Based on LDA, we developed a new topic model DbLDA to utilize the commonalities inside each subset in a text database.

These works was published on WWWJ2017 and WWWJ2018

2014 - present

Exploratory Data Analysis, DAS Lab

Hubble: A Smart System for Data Exploration in Big Data Era, bridge the gap between analysts and data

iExplore: Accelerating Exploratory Data Analysis by Predicting User Intention. (i)We introduced an intention model to help the iExplore system have a comprehensive understanding of user’s intention. (ii)We also studied the convergence of the intention model to figure out the characteristic of the exploratory process.

This work was published on DASFAA2018

2016 - present

Chemical Markup Language, WME Lab

We designed a method to translate Chemical Markup Language (CML) to Braille to facilitate information accessibility.

Thisworkwas funded by Hui-Chun Chin and Tsung-Dao Lee ChineseUndergraduate Research Endowment, CURE.

My undergraduate thesis about this work was awarded Excellent Undergraduate Thesis.

2013 - 2014


Programming Language & Tools
  • c++, Java, Python
  • Hadoop, Spark, Pytorch

Awards & Certifications

  • Outstanding Ph.D. Student at Fudan University
  • Outstanding Students of Master’s Degrees at Fudan University
  • the 15th Chun-Tsung Scholar
  • Outstanding Graduate at Lanzhou University
  • Excellent Undergraduate Thesis of Lanzhou University
  • IBM University Program Academic Qualification
  • National scholarship
  • National inspirational scholarship


Gallery contains some pictures taken by me.