Page Not Found
Page not found. Your pixels are in another canvas.
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Page not found. Your pixels are in another canvas.
About me
Data nowadays are produced at an unprecedented rate; Cheap sensors, existing and synthesized datasets, the emerging internet of things, and especially social media, make the collection of vast and complicated datasets a relatively easy task. With limited time and human power, the ability to effectively harness the power of big data is a problem encountered by many companies and organizations. The project tries to ease the data understanding process by compressing and evaluating the valuable information contained in the data. Specifically, <ul class='archive__item-excerpt'> <li>Proposed a topological collapse-based unsupervised method for document summarization. The method outperforms state-of-the-art methods on standard datasets composed of scientific papers. (Published in [SPAWC’16]) </li>
</ul>
Continuous representations have been widely adopted in recommender systems where a large number of entities are represented using embedding vectors. As the cardinality of the entities increases, the embedding components can easily contain millions of parameters and become the bottleneck in both storage and inference due to large memory consumption. This work focuses on post-training 4-bit quantization on the continuous embeddings. We propose row-wise uniform quantization with greedy search and codebook-based quantization that consistently outperforms state-of-the-art quantization approaches on reducing accuracy degradation. We deploy our uniform quantization technique on a production model in Facebook and demonstrate that it can reduce the model size to only 13.89% of the single-precision version while the model quality stays neutral. (Accepted in [MLSys@NeurIPS’19])
(Figure: A framework for automatic synthesis of HPC advising tools.)
Achieving high performance on computing systems is a complicated process. It requires a deep understanding of the underlying computing systems, the architectural properties, and proper implementations to take full advantage of the computing systems. In this project, we explore novel ideas to address problems in program optimization and synthesis by leveraging the recent progress in Natural Language Processing. <ul class='archive__item-excerpt'> <li> Proposed a Natural Language Understanding-driven programming framework that automatically synthesizes code based on inputs expressed in natural language using no training examples. (Accepted in [FSE’20]) </li>
</ul>
(Figure: Reuse-centric optimization.)
As a critical link between software and computing hardware, programming system plays an essential role in ensuring the efficiency, scalability, security, and reliability of machine learning. This project examines the challenges in machine learning from the programming system perspective by developing simple yet effective reuse-centric approaches. Specifically, <ul class='archive__item-excerpt'> <li>Proposed a flexible ensemble DNN training framework for efficiently training a heterogeneous set of DNNs; achieved up to 1.97X speedups over the state-of-the-art framework that was designed for homogeneous DNN ensemble training. (Published in [MLSys’20]) </li>
</ul>
Published in In 2016 IEEE 17th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), pp. 1-5. IEEE, 2016, 2016
Download here
Published in In Proceedings of the 38th ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 33-48. ACM, 2017. (Acceptance rate: 15% (47/322)), 2017
Download here
Published in In Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, p. 10. ACM, 2017. (Acceptance rate: 18% (61/327)), 2017
Download here
Published in SysML, Feb 16th, 2018, Stanford University, 2018, 2018
Published in In 2018 IEEE 34th International Conference on Data Engineering (ICDE), pp. 1224-1227. IEEE, 2018. (short paper) (Acceptance rate: 23%), 2018
Download here
Published in In Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, p. 64. IEEE, 2018. (Acceptance rate: 23%), 2018
Download here
Published in In 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp. 1538-1549. IEEE, 2019. (Acceptance rate: 18%), 2019
Download here
Published in In Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation, pp. 717-730. ACM, 2019. (Acceptance rate: 27.7% (76/274)), 2019
Download here
Published in MLSys Workshop on Systems for ML @ NeurIPS, 2019, 2019
Download here
Published in In Advances in Neural Information Processing Systems, pp. 5735-5744. 2019. (Acceptance rate: 21.2% (1428/6743)), 2019
Download here
Published in MLSys'20. (Acceptance rate: 20.0% (34/170)), 2020
Download here
Published in In The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Sacramento, California, United States, November 2020. (Acceptance rate: 101/360=28%), 2020
Download here
Published in In IEEE Transactions on Parallel and Distributed Systems (TPDS), 2020, 2020
Download here
Published in In The ACM SIGPLAN 2021 International Conference on Compiler Construction, 2021, 2021
Download here
Published in Information Systems, 2021, 2021
Published in In Communications of the ACM, 2021, 2021
Download here
Published in In Proceedings of International Conference on Supercomputing, 2021. (Acceptance rate: 25% (39/157)), 2021
Download here
Published in In MCHPC'21 Workshop, 2021
Download here
Published in In ACM SIGOPS Operating Systems Review, 2021, 2021
Download here
Published in In IEEE International Conference on Data Mining, 2021 (short paper). (Acceptance rate: 20% (198/990)), 2021
Download here
Published in The 2022 International Symposium on Code Generation and Optimization (CGO), Seoul, South Korea, 2022, 2022
Download here
Published in [CrossFL 2022 Workshop @ MLSys'22](https://crossfl2022.github.io/program/), 2022
Download here
Published in [ICML 2022 AI4Science Workshop](http://ai4science.net/icml22/), 2022
Download here
Published in 1st International Conference on Automated Machine Learning, July 25-27, 2022. (Acceptance rate: 19.2%), 2022
Download here
Published in IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), Taipei, Taiwan, July 18-22, 2022. (Acceptance rate: 29%), 2022
Download here
Published in In International Conference on Very Large Data Bases, 2022, 2022
Download here
Published in 36th Conference on Neural Information Processing Systems (NeurIPS 2022), November 28, 2022. (Acceptance rate: 25.6%), 2022
Download here
Published in In IEEE Access, 2023, 2023
Download here
Published in ACM SIGPLAN International Symposium on Memory Management, 2023, 2023
Download here
Published in IEEE Transactions on Neural Networks and Learning Systems, 2023, 2023
Download here
Published in 40th International Conference on Machine Learning, Jul. 23-29, 2023, 2023
Download here
Published in 40th International Conference on Machine Learning, Jul. 23-29, 2023, 2023
Download here
Published in The 29th International Conference on Mobile Computing and Networking, Madrid, Spain, Oct. 2-6, 2023, 2023
Download here
Published in The 32nd International Conference on Parallel Architectures and Compilation Techniques, Oct. 21-25, 2023, 2023
Download here
Published in The 2023 Conference on Neural Information Processing Systems, Dec. 10-16, 2023, 2023
Download here
Published in The 2024 ACM Conference on Architectural Support for Programming Languages and Operating Systems, April 27-May 1, 2024, 2024
Download here
Published in The 2024 European Conference on Computer Systems (EuroSys), April 22-25, 2024, 2024
Download here
Published in The 33rd International Symposium on, 2024
Download here
Published in In IEEE Access, 2024, 2024
Download here
Published in The 22nd ACM International Conference on Mobile Systems, Applications, and Services (MobiSys), Tokyo, Japan, June 3-7, 2024, 2024
Download here
Published in International Conference on Multimedia Information Processing and Retrieval, August 07, 2024, 2024
Download here
Published in ACM MM '24, October 28-November 1, 2024, Melbourne, VIC, Australia, 2024
Download here
Published in NeurIPS'24 Workshop AI4Mat, Dec 15, 2024, Vancouver, 2024
Download here
Published in NeurIPS'24 Workshop MLforSys, Dec 14, 2024, Vancouver, 2024
Download here
Published in NeurIPS '24, Mon, Dec 9, 2024 – Sun, Dec 15, 2024, Vancouver, 2024
Download here
Published in NeurIPS '24, Mon, Dec 9, 2024 – Sun, Dec 15, 2024, Vancouver, 2024
Download here
Published in Extended Abstracts of the CHI Conference on Human Factors in Computing Systems, 2025, 2025
Download here
Published in The Eighth Annual Conference on Machine Learning and Systems, Santa Clara, May 12-15, 2025. (Acceptance Rate = 22% (61/271)), 2025
Download here
Published in The Eighth Annual Conference on Machine Learning and Systems, Santa Clara, May 12-15, 2025 (Acceptance Rate = 22% (61/271)), 2025
Download here
Published in 32nd IEEE Symposium on Computer Arithmetic, Jun 23-25, 2025, 2025
Download here
Published in Neurocomputing, 2025, 2025
Download here
Published in ACM Transactions on Knowledge Discovery from Data, Volume 20, Issue 2, 2025, 2025
Download here
Published in First Exploration in AI Today Workshop at ICML (EXAIT at ICML 2025), 2025
Download here
Published in Proceedings of the VLDB Endowment, 2025, 2025
Download here
Published in arXiv, 2025, 2025
Download here
Published in ACM Multimedia Systems Conference, 2026, 2026
Published in IEEE Pacific Visualization Conference, 2026, 2026
Published in The 9th Annual Conference on Machine Learning and Systems (MLSys 2026), Bellevue, WA, May 18-22, 2026, 2026
Download here
Published in ACM/IEEE International Conference on Embedded Artificial Intelligence and Sensing Systems, 2026, 2026
Graduate Seminar, UMass Amherst, , 2020
This seminar discusses cutting-edge research on the topics of machine learning for systems and systems for machine learning.