I'm a third year Ph.D student working with Prof. Felix Wu. I also get a lot of help from Prof. Norm Matloff, Hao Chen, Raju Pandey, and Michael Gertz.
Donate for China Earthquake Relief
Education
- Ph.D Candidate, Computer Science, University of California, Davis. (Advanced to candidacy in Jul. 2008)
- M.S, Electronic Engineering, Tsinghua University, Beijing, China. Advisor: Dr. Xing Li. (Received in Jan. 2005)
- B.S, Electronic Engineering, Tsinghua University, Beijing, China. (Received in Jul 2002)
Research
Currently most our research projects are under the large umbrella of Davis Social Links. I am looking at the following two issues.
- Online Social Network Dynamics
- Online Privacy and Trust management
My past research projects are listed as follows. Some of them have been transferred to Compass Search.
- Link Analysis:
- Accelerated PageRank Algorithm
- Distributed PageRank Computation
- Web Crawler:
- High performance crawling: I developed a distributed web crawler as a senior.
- Crawling policy: a) server politeness. b) high quality web page. c) crawling coverage.
- Large Scale Duplicate Document Detection: This work was conducted when I visited Microsoft Research Asia.
- Webpage Classification: Our Compass team took the first place in Chinese Web Page Categorization Competition, April 2003.
- IPv6 Web Evolution and Performance Analysis: I monitored the evolution of more than 1,000 IPv6 Web sites from 2001 to 2005. Here is our IPv6 search engine.
Publications
Journal Papers:
- Shaozhi Ye, Ji-Rong Wen, and Wei-Ying Ma. A Systematic Study on Parameter Correlations in Large Scale Duplicate Document Detection. Knowledge and Information Systems, Vol.14, No.2, pp 217-232, Feb. 2008.
- Ming Jia, Jiangtao Wen, Shaozhi Ye, and Xing Li. Error Restricted Fast MAP Decoding of VLC. IEEE Communication Letters, Vol.9, No.10, pp 909-911, Oct. 2005.
- Shaozhi Ye, Hui Liu, Yue Li, Hui Huang, and Xing Li. Development of IPv6 Networks Viewed from the Angle of Search Engine. Zhongxing Telecom Technology, Vol.40, pp 1-3, 2002. (in Chinese)
- Hui Liu, Shaozhi Ye, Hui Huang, and Xing Li. IPv6 Networks Analysis based on Search Engine. Telecommunications Science, No.3, pp 43-45, 2002. (in Chinese)
Conference and Workshop Papers:
The acceptance rates of some conferences and workshops are given (papers accepted/papers submitted).
- Daniela Oliveira, Jedidiah Crandall, Gary Wassermann, Shaozhi Ye, Felix Wu, Zhendong Su, and Frederic Chong. Bezoar: Automated Virtual Machine-based Full-System Recovery from Control-Flow Hijacking Attacks. Accepted by 2008 IEEE/IFIP Network Operations and Management Symposium (NOMS'08), 2007. (27.5%=64/233)
- Ming Jia, Shaozhi Ye, Xing Li, and Julie Dickerson. Web Site Recommendation Using HTTP Traffic. In Proceedings of the 7th IEEE International Conference on Data Mining (ICDM'07), 2007. (19.2%=101/526)
- Lerone Banks, Shaozhi Ye, Yue Huang, and S. Felix Wu. Davis Social Links: Integrating Social Networks with Internet Routing. In Proceedings of ACM SIGCOMM Workshop on Large-Scale Attack Defense (LSAD'07), 2007. (To appear)
- Shaozhi Ye, Ji-Rong Wen, and Wei-Ying Ma. A Systematic Study of Parameter Correlations in Large Scale Duplicate Document Detection. In Proceedings of the 10th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD'06), Lecture Notes in Artificial Intelligence (LNAI), vol: 3918, pp 275-284, 2006. (13.4%=67/501) [slides] (Best Student Paper Nomination)
- Liang Chen, Shaozhi Ye, and Xing Li. Template Detection for Large Scale Search Engines. In Proceedings of the 21st Annual ACM Symposium on Applied Computing (SAC'06), pp 1094-1098, April 2006. (29%=16/55)
- Yangbo Zhu, Shaozhi Ye, and Xing Li. Distributed PageRank Computation Based on Aggregation-Disaggregation Methods. In Proceedings of the 14th ACM International Conference on Information and Knowledge Management (CIKM'05) , pp 578-585, 2005. (18%=76/425)
- Yi Wang, Shaozhi Ye, and Xing Li. Understanding Current IPv6 Performance: A Measurement Study. In Proceedings of the 10th IEEE Symposium on Computers and Communications (ISCC'05), pp 71-76, 2005. (37%=147/400)
- Jingfang Xu, Shaozhi Ye, and Xing Li. Query based Chinese Phrase Extraction for Site Search. In Proceedings of the fifth international conference on Web Information Systems Engineering (WISE'04), Lecture Notes in Computer Science (LNCS) vol:3306, pp 125-134, 2004. (24%)
- Shaozhi Ye, Guohan Lu, and Xing Li. Workload-Aware Web Crawling and Server Workload Detection. In Proceedings of the second Asia-Pacific Advanced Network Research Workshop, pp 263-269, Jul 2004.
- Shaozhi Ye, Ruihua Song, Ji-Rong Wen, and Wei-Ying Ma. A Query-Dependent Duplicate Detection Approach for Large Scale Search Engines. In Proceedings of the sixth Asia Pacific Web Conference (APWeb'04), Lecture Notes in Computer Science (LNCS), vol:3007, pp48-58, 2004.
- Ji-Rong Wen, Ruihua Song, Deng Cai, Kaihua Zhu, Shipeng Yu, Shaozhi Ye, and Wei-Ying Ma. Microsoft Research Asia at the Web Track of TREC 2003. In Proceedings of the 12th Text Retrieval Conference (TREC 2003), pp 408-417, Nov, 2003.
- Yue Li, Hui Liu, Gang Zhu, Shaozhi Ye, and Xing Li. Analysis of IPv6 over Search Engine. In Proceedings of the fifth Joint AEARU Workshop on Web Technology and Computer Science. Oct 2003.
- Hui Liu, Ran Peng, Shaozhi Ye, and Xing Li. An Efficient Centroid Based Chinese Web Page Classifier. In Proceedings of the first Asia-Pacific Advanced Network Research Workshop, pp 9-14, 2003.
Technical Report:
-
Shaozhi Ye, Felix Wu, Raju Pandey, and Hao Chen. Noise Injection for Search Privacy Protection. CSE-2008-10, University of California, Davis.
-
David Waetjen, Joshua Viers, Allan Hollander, Shaozhi Ye, and James Quinn. Data Management Strategies Report. In Cosumnes Research Group: Final Report, Chapter 6: Data Management, Jun. 2006.
Note: The materials are presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders.
Awards and Honors
- Excellent Academic Performance Scholarship, First Prize, Tsinghua University, Nov. 2004. (25 out of 575)
- The eighth place at Topic Distillation task and the fifth place at Named/homepage Finding task in Web Track, TREC2003. With Microsoft Research Asia.
- The first place at Chinese Web Page Categorization Competition, in conjunction with the first Chinese Symposium on Search Engine and Web Mining, organized and sponsored by China Computer Federation, Beijing, P.R.China, April 2003. With Compass group.
Teaching Assistant
Internship
Reviewing Activities
- The 27th Conference on Computer Communications(INFOCOM'08)
- The 32nd International Conference on Very Large Data Bases(VLDB'06)
- The 2006 IEEE International Conference on Communications (ICC'06)
- The third Chinese Symposium on Search Engine and Web Mining (SEWM'05)
- The 2004 IEEE/WIC/ACM International Conference on Web Intelligence (WI'04)
- The Joint Conference of 10th Asia-Pacific Conference on Communications and fifth International Symposium on Multi-Dimensional Mobile Communications (APCC/MDMC'04)
"The greatest challenge to any thinker is stating the problem in a way that will allow a solution." -- Bertrand Russell