• No results found

Postdoctoral Researcher, Data Sciences Group

N/A
N/A
Protected

Academic year: 2021

Share "Postdoctoral Researcher, Data Sciences Group"

Copied!
7
0
0

Loading.... (view fulltext now)

Full text

(1)

Dongfang Zhao, Ph.D.

Contact Advanced Computing, Mathematics and Data Division (206) 395-9527 Pacific Northwest National Laboratory [email protected]

Seattle, Washington, United States http://tinyurl.com/zhaod

Professional Experience

IBM Almaden Research Center, San Jose, CA Summer 2015 Research Intern, Cloud Management Services Department

Supervisor: Dr. Heiko Ludwig

Argonne National Laboratory, Lemont, IL Spring 2014 Research Intern, Mathematics and Computer Science Division

Supervisor: Dr. Robert Ross

Pacific Northwest National Laboratory, Richland, WA Summer 2013 Research Intern, Data-intensive Scientific Computing Group

Supervisor: Dr. Jian Yin

Epic Systems Corporation, Madison, WI 2009 – 2011 Software Developer, EpicCare Ambulatory Team

Academic Background

Pacific Northwest National Laboratory, Seattle, WA 2015 – 2016 Postdoctoral Researcher, Data Sciences Group

Illinois Institute of Technology, Chicago, IL 2012 – 2015 Ph.D., Computer Science

Advisor: Dr. Ioan Raicu

Committee: Dr. Xian-He Sun, Dr. Zhiling Lan, Dr. Erdal Oruklu Dissertation: Big Data System Infrastructure at Extreme Scales

Emory University, Atlanta, GA 2006 – 2008

M.S., Computer Science

Katholieke Universiteit Leuven, Belgium 2003 – 2005 M.S., Statistics and Artificial Intelligence

Northeastern University, China 1999 – 2003

B.E., Computer Science and Technology

Selected Publications

[TPDS] Dongfang Zhao, Ning Liu, Dries Kimpe, Robert Ross, Xian-He Sun, and Ioan Raicu. Towards Exploring Data-Intensive Scientific Applications at Extreme Scales through Systems and Simulations. IEEE Transactions on Parallel and Distributed Systems. 14 pages, doi:10.1109/TPDS.2015.2456896, 2015.

[TSC] Dongfang Zhao, Kan Qiao, Jian Yin, and Ioan Raicu. Dynamic Virtual Chunks: On Supporting Efficient Accesses to Compressed Scientific Data. IEEE Transactions on Services Computing. 14 pages, doi:10.1109/TSC.2015.2456889, 2015.

[TPAMI] Dongfang Zhao and Li Yang. Incremental Isometric Embedding of High-Dimensional Data Using Connected Neighborhood Graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence. 31(1):86 – 98, January 2009.

(2)

Invited Talks “Big Data at Extreme Scales”

West Virginia University, Morgantown, WV April 2015 California State University, Sacramento, CA March 2015

University of Idaho, Moscow, ID February 2015

Conference Presentations

[06/2015] “High-Performance Storage Support for Scientific Applications on the Cloud”, ACM HPDC’15, Portland, OR.

[11/2014] “Storage Support for Data-Intensive Applications on Extreme-Scale HPC Systems”, ACM/IEEE SC’14, New Orleans, LA.

[10/2014] “Virtual Chunks: On Supporting Random Accesses to Scientific Data in Compressible Storage Systems”, IEEE BigData’14, Washington, DC.

[10/2014] “FusionFS: Towards Supporting Data-Intensive Scientific Applications on Extreme-Scale High-Performance Computing Systems”, IEEE BigData’14, Washington, DC.

[05/2014] “HyCache+: Towards Scalable High-Performance Caching Middleware for Parallel File Systems”, IEEE/ACM CCGrid’14, Chicago, IL.

[09/2013] “Distributed Data Provenance for Large-Scale Data-Intensive Computing”, IEEE Cluster’13, Indianapolis, IN.

[05/2013] “HyCache: A User-Level Caching Middleware for Distributed File Systems”, IEEE IPDPS’13, Boston, MA.

[04/2013] “Exploring Reliability of Exascale Systems Through Simulations”, ACM/SCS HPC’13, San Diego, CA.

[11/2012] “Distributed File Systems for Exascale Computing”, ACM/IEEE SC’12, Salt Lake City, UT.

Teaching Teaching Assistant

CS554 Data Intensive Computing, IIT Spring 2015, Fall 2013 CS553 Cloud Computing, IIT Fall 2014, Spring 2013 CS495 Introduction to Distributed Systems, IIT Fall 2012 Student Supervision

Kevin Brandstatter, BS/MS Computer Science, IIT Spring 2015 Projects: Distributed indexing in file systems

First employment: software developer, Epic, Madison, WI

Daniel Gordon, BS Computer Science, IIT Fall 2014 Projects: Student cluster competition at ACM/IEEE SC’14

First employment: software developer, Nokia, Chicago, IL

Chen Shou, MS Computer Science, IIT Fall 2013

Projects: Provenance awareness in distributed file systems Publications: IEEE Cluster’13, USENIX TaPP’13

First employment: software developer, Amazon, Seattle, WA

Kent Burlingame, BS Computer Science, IIT Spring 2013 Projects: GPU-accelerated erasure coding

(3)

First employment: software developer, Microsoft, Seattle, WA

Da Zhang, MS Computer Science, IIT Fall 2012

Projects: Application efficiency and reliability at exascale Publications: ACM/SCS HPC’13

First employment: Ph.D. program, Virginia Tech

Service Editorial Board

Springer Journal of Big Data 2015 – present

Journal Referee

IEEE Transactions on Parallel and Distributed Systems 2014 – present IEEE Transactions on Cloud Computing 2014 – present

Springer Journal of Cluster Computing 2015

Springer Journal of Grid Computing 2015

Elsevier Journal of Parallel Computing 2015

Elsevier Journal of Future Generation Computer Systems 2015 Journal of Computer Engineering and Information Technology 2015

Mathematical Problems in Engineering 2015

Conference Co-Chair

IEEE/ACM International Symposium on Big Data Computing 2015 Technical Program Committee

Int’l Conference on Data Mining, Internet Computing, and Big Data 2016 Int’l Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers 2015

The 2nd IEEE/ACM Int’l Symposium on Big Data Computing 2015 Int’l Conference on Signal Processing and Data Mining 2015 Int’l Conference on Computer Technologies Innovations & Applications 2015 Int’l Conference on Computer Vision and Pattern Recognition 2015 Int’l Conference on Education & eLearning Innovations 2015 Int’l Conference on Cloud Computing & Cryptography 2015 Int’l Conference on Engineering & Computational Innovation Sciences 2015 Int’l Conference on Advances in Computing, Communications & Informatics 2015 Int’l Conference on Cloud Computing Technologies and Applications 2015 Int’l Summit on Bio-Metrics and Smart Government 2015 IEEE Int’l Conference on Adaptive Science & Technology 2014 Reviewer

The 6th ACM Conference on Data and Application Security and Privacy 2016 The 21st IEEE International Conference on Parallel and Distributed Systems 2015 Int’l Workshop on Collaborative methodologies to Accelerate Scientific Knowledge

discovery in big data 2014

IEEE/ACM Int’l Symposium on Big Data Computing 2014 IEEE Int’l Conference on Cloud Computing Technology and Science 2014 Int’l Conference on Massive Storage Systems and Technology 2014 Int’l Workshop on Data-Intensive Scalable Computing Systems 2013 IEEE Int’l Conf. on High Performance Computing and Communications 2013

IEEE Int’l Conference on Big Data 2013

Int’l Conference on Cloud and Green Computing 2013

(4)

Assistant Professor Phone: 312-567-5704 Department of Computer Science E-mail: [email protected] Illinois Institute of Technology

Dr. Jian Yin

Senior Scientist Phone: 509-371-6398

Mathematics and Data Division E-mail: [email protected] Pacific Northwest National Laboratory

Dr. Xian-He Sun

Distinguished Professor, Fellow of the IEEE Phone: 312-567-5260 Department of Computer Science E-mail: [email protected] Illinois Institute of Technology

Dr. Robert Ross

Senior Scientist Phone: 630-252-4588

Mathematics and Computer Science Devision E-mail: [email protected] Argonne National Laboratory

Dr. Zhiling Lan

Professor Phone: 312-567-5710

Department of Computer Science E-mail: [email protected] Illinois Institute of Technology

Dr. Edward Reingold

Professor, Fellow of the ACM Phone: 312-567-3309 Department of Computer Science E-mail: [email protected] Illinois Institute of Technology

Dr. Yi Zhao

Associate Professor Phone: 614-247-7424

Department of Biomedical Engineering E-mail: [email protected] The Ohio State University

Collaborators Dr. Philip Carns, Argonne National Laboratory, Chicago, IL Dr. Linhua Jiang, Chinese Academy of Sciences, Shanghai, China Dr. Dries Kimpe, KCG Holdings Inc., Chicago, IL

Dr. Jialin Liu, Lawrence Berkeley National Laboratory, Berkley, CA Dr. Heiko Ludwig, IBM Almaden Research Center, San Jose, CA Dr. Tanu Malik, University of Chicago, IL

Dr. Kan Qiao, Google, Seattle, WA Dr. Wei Tang, Google, New York City, NY

Dr. Steven Timm, Fermi National Accelerator Laboratory, Chicago, IL Dr. Ke Wang, Intel, Portland, OR

Dr. Zhao Zhang, University of California, Berkeley, CA

Complete List of Refereed Publications

——– Under Review ——–

[33] Dongfang Zhao, Kan Qiao, Zhou Zhou, Tonglin Li, Ke Wang, Xiaobing Zhou, and Ioan Raicu. Exploiting Multi-Cores for Efficient Interchange of Large Messages in Distributed Systems. Concurrency and Computation: Practice and Experience (CCPE).

[32] Dongfang Zhao, Ke Wang, Kan Qiao, Tonglin Li, Iman Sadooghi, Xiaobing Zhou, and Ioan Raicu. Toward High-performance Key-value Stores through GPU Encoding and Locality-aware Scheduling. Journal of Parallel and Distributed Computing (JPDC).

(5)

[31] Dongfang Zhao, Kan Qiao, Zhou Zhou, Jian Yin, Tonglin Li, Ke Wang, Xiaobing Zhou, and Ioan Raicu. Toward Distributed and Hierarchical Metadata Index. IEEE IPDPS 2016.

——– 2015 ——–

[30] Iman Sadooghi, Ke Wang, Shiva Srivastava, Dharmit Patel, Dongfang Zhao, Tonglin Li, and Ioan Raicu. FaBRiQ: Leveraging Distributed Hash Tables

towards Distributed Publish-Subscribe Message Queues. IEEE/ACM International Symposium on Big Data Computing. Limassol, Cyprus, December 2015.

[29] Jian Yin and Dongfang Zhao. Data Confidentiality Challenges in Big Data Applications. Proceedings of the 2015 IEEE International Conference on Big Data. Santa Clara, CA, USA, October 2015.

[28] Xiaobing Zhou, Tonglin Li, Ke Wang, Dongfang Zhao, Iman Sadooghi, and Ioan Raicu. MHT: A Light-weight Scalable Zero-hop MPI Enabling Distributed Key-Value Store. Proceedings of the 2015 IEEE International Conference on Big Data. Santa Clara, CA, USA, October 2015.

[27] Tonglin Li, Ke Wang, Shiva Srivastava, Dongfang Zhao, Kan Qiao, Iman Sadooghi, Xiaobing Zhou, and Ioan Raicu. A Flexible QoS Fortified Distributed Key-Value Storage System for the Cloud. Proceedings of the 2015 IEEE International Conference on Big Data. Santa Clara, CA, USA, October 2015.

[26] Dongfang Zhao, Nagapramod Mandagere, Gabriel Alatorre, Mohamed Mohamed, and Heiko Ludwig. Toward Locality-aware Scheduling for Containerized Cloud Services. Proceedings of the 2015 IEEE International Conference on Big Data. Santa Clara, CA, USA, October 2015.

[25] Zhou Zhou, Xu Yang, Dongfang Zhao, Paul Rich, Wei Tang, Jia Wang, and Zhiling Lan. I/O-Aware Batch Scheduling for Petascale Computing Systems. Proceedings of the 2015 IEEE International Conference on Cluster Computing. Chicago, IL, USA, September 2015.

[24] Tonglin Li, Chaoqi Ma, Jiabao Li, Xiaobing Zhou, Ke Wang, Dongfang Zhao, and Ioan Raicu. GRAPH/Z: A Key-Value Store Based Scalable Graph Processing System. Proceedings of the 2015 IEEE International Conference on Cluster Computing. Chicago, IL, USA, September 2015.

[23] Dongfang Zhao, Kan Qiao, Jian Yin, and Ioan Raicu. Dynamic Virtual Chunks: On Supporting Efficient Accesses to Compressed Scientific Data. IEEE Transactions on Services Computing (TSC). 14 pages, doi:10.1109/TSC.2015.2456889, July 2015.

[22] Tonglin Li, Xiaobing Zhou, Ke Wang, Dongfang Zhao, Iman Sadooghi, Zhao Zhang, and Ioan Raicu. A Convergence of Key-Value Storage Systems from

Clouds to Supercomputers. Concurrency and Computation: Practice and Experience (CCPE). doi:10.1002/cpe.3614, July 2015.

[21] Dongfang Zhao, Ning Liu, Dries Kimpe, Robert Ross, Xian-He Sun, and Ioan Raicu. Towards Exploring Data-Intensive Scientific Applications at Extreme Scales through Systems and Simulations. IEEE Transactions on Parallel and Distributed Systems (TPDS). 14 pages, doi:10.1109/TPDS.2015.2456896, June 2015.

[20] Dongfang Zhao, Xu Yang, Iman Sadooghi, Gabriele Garzoglio, Steven Timm, and Ioan Raicu. High-Performance Storage Support for Scientific Applications

(6)

on the Cloud. ScienceCloud workshop, proceedings of the 24th ACM International Symposium on High Performance Distributed Computing (HPDC). Portland, OR, USA, June 2015.

[19] Tonglin Li, Kate Keahey, Ke Wang, Dongfang Zhao, and Ioan Raicu. A

Dynamically Scalable Cloud Data Infrastructure for Sensor Networks. ScienceCloud workshop, proceedings of the 24th ACM International Symposium on High

Performance Distributed Computing (HPDC). Portland, OR, USA, June 2015. [18] Dongfang Zhao, Kan Qiao, and Ioan Raicu. Towards Cost-Effective and

High-Performance Caching Middleware for Distributed Systems. International Journal of Big Data Intelligence (IJBDI). February 2015 (accepted).

——– 2014 ——–

[17] Dongfang Zhao and Ioan Raicu. Storage Support for Data-Intensive Scientific Applications on the Cloud. NSFCloud Workshop on Experimental Support for Cloud Computing. Arlington, VA, USA, December 2014.

[16] Dongfang Zhao and Ioan Raicu. Storage Support for Data-Intensive Applications on Extreme-Scale HPC Systems. The 2014 ACM/IEEE Conference on Supercomputing (SC). New Orleans, LA, USA, November 2014.

[15] Dongfang Zhao, Jian Yin, Kan Qiao, and Ioan Raicu. Virtual Chunks: On Supporting Random Accesses to Scientific Data in Compressible Storage Systems. Proceedings of the 2014 IEEE International Conference on Big Data. Washington, DC, USA, October 2014.

[14] Ke Wang, Xiaobing Zhou, Tonglin Li, Dongfang Zhao, Michael Lang, and Ioan Raicu. Optimizing Load Balancing and Data-Locality with Data-Aware Scheduling. Proceedings of the 2014 IEEE International Conference on Big Data. Washington, DC, USA, October 2014.

[13] Dongfang Zhao, Zhao Zhang, Xiaobing Zhou, Tonglin Li, Ke Wang, Dries Kimpe, Philip Carns, Robert Ross, and Ioan Raicu. FusionFS: Towards Supporting Data-Intensive Scientific Applications on Extreme-Scale High-Performance

Computing Systems. Proceedings of the 2014 IEEE International Conference on Big Data. Washington, DC, USA, October 2014.

[12] Dongfang Zhao, Kan Qiao, and Ioan Raicu. HyCache+: Towards Scalable High-Performance Caching Middleware for Parallel File Systems. Proceedings of the 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid). Chicago, IL, USA, May 2014.

——– 2013 ——–

[11] Dongfang Zhao, Jian Yin, and Ioan Raicu. Improving the I/O Throughput for Data-Intensive Scientific Applications with Efficient Compression Mechanisms. The 2013 ACM/IEEE Conference on Supercomputing (SC). Denver, CO, USA, November 2013.

[10] Dongfang Zhao, Chen Shou, Tanu Malik, and Ioan Raicu. Distributed Data Provenance for Large-Scale Data-Intensive Computing. Proceedings of the 2013 IEEE International Conference on Cluster Computing. Indianapolis, IN, USA, September 2013.

[9] Dongfang Zhao, Kent Burlingame, Corentin Debains, Pedro Alvarez-Tabio, and Ioan Raicu. Towards High-Performance and Cost-Effective Distributed Storage Systems with Information Dispersal Algorithms. Proceedings of the

(7)

2013 IEEE International Conference on Cluster Computing. Indianapolis, IN, USA, September 2013.

[8] Tonglin Li, Xiaobing Zhou, Kevin Brandstatter, Dongfang Zhao, Ke Wang, Anupam Rajendran, Zhao Zhang, and Ioan Raicu. ZHT: A Light-Weight Reliable Persistent Dynamic Scalable Zero-Hop Distributed Hash Table. Proceedings of the 27th IEEE International Parallel & Distributed Processing Symposium (IPDPS). Boston, MA, USA, May 2013.

[7] Dongfang Zhao and Ioan Raicu. HyCache: A User-Level Caching Middleware for Distributed File Systems. Proceedings of International Workshop on High Performance Data Intensive Computing, collocated with IEEE International Symposium on Parallel and Distributed Processing (IPDPS). Boston, MA, USA, May 2013.

[6] Dongfang Zhao, Da Zhang, Ke Wang, and Ioan Raicu. Exploring Reliability of Exascale Systems Through Simulations. Proceedings of the 21st ACM/SCS High Performance Computing Symposium (HPC). San Diego, CA, USA, April 2013.

[5] Chen Shou, Dongfang Zhao, Tanu Malik, and Ioan Raicu. Towards a Provenance-Aware Distributed Filesystem. The 5th Workshop on the Theory and Practice of Provenance, collocated with USENIX Symposium on Networked Systems Design and Implementation (NSDI). Lombard, IL, USA, April 2013. ——– 2012 and before ——–

[4] Dongfang Zhao and Ioan Raicu. Distributed File Systems for Exascale Computing. The 2012 ACM/IEEE Conference on Supercomputing (SC). Salt Lake City,

UT, USA, November 2012.

[3] Dongfang Zhao and Li Yang. Incremental Isometric Embedding of High-Dimensional Data Using Connected Neighborhood Graphs. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). 31(1):86 – 98, January 2009.

[2] Robin Lohfert, James Lu, and Dongfang Zhao. Solving SQL Constraints by Incremental Translation to SAT. Proceedings of the 21st International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems (IEA/AIE). Wroclaw, Poland, June 2008.

[1] Dongfang Zhao and Li Yang. Incremental Construction of Neighborhood

Graphs for Nonlinear Dimensionality Reduction, Proceedings of the 18th International Conference on Pattern Recognition (ICPR). Hong Kong, China, August 2006.

References

Related documents

2014 AAAI International Conference on Weblogs and Social Media (ICWSM); IEEE/CIC ICCC Symposium on Social Networks and Big Data (SNBD)..

Nagaveni, ― Design and Implementation of an efficient Two-level Scheduler for Cloud Computing Environment ‖ , 10th IEEE/ACM International Conference on Cluster, Cloud

Wang, "Price Trend Prediction of Stock Market Using Outlier Data Mining Algorithm," 2015 IEEE Fifth International Conference on Big Data and Cloud Computing, Dalian, 2015,

Wright, “Performance Analysis of High Performance Computing Applications on the Amazon Web Services Cloud,” in 2nd IEEE International Conference on Cloud Computing Technology

Shekhar et al., Spatial Big Data Challenges Intersecting Mobility and Cloud Computing, ACM.. SIGMOD Workshop on Data Engineering for Wireless and Mobile

Program Committee, ACM Conference on Human Factors in Computing Systems (CHI) (2013) Program Committee, ACM Virtual Reality Software and Technology (2009-2014). Program Committee,

The Second International Symposium on Cloud Computing and Big Data Challenges (ISBCC 16) solicits high quality original research papers in all aspects of Cloud Computing,

The symposium on Colossal Data Analysis and Networking (CDAN’16) solicits high quality original research papers in all aspects of Cloud Computing, Big Data,