Publications


2024

Preference Poisoning Attacks on Reward Model Learning

Junlin Wu, Jiongxiao Wang, Chaowei Xiao, Chenguang Wang, Ning Zhang, Yevgeniy Vorobeychik

In arXiv preprint arXiv:2402.01920 (arXiv 2024)


Measuring Vision-Language STEM Skills of Neural Models

Jianhao Shen, Ye Yuan, Srbuhi Mirzoyan, Ming Zhang, and Chenguang Wang.

In The Twelfth International Conf. on Learning Representations (ICLR 2024).


2023

Agent Instructs Large Language Models to be General Zero-shot Reasoners

Nicholas Crispino, Kyle Montgomery, Fankun Zeng, Dawn Song, Chenguang Wang

In arXiv preprint arXiv:2310.03710 (arXiv 2023)


CodeIPPrompt: Intellectual Property Infringement Assessment of Code Language Models

Zhiyuan Yu, Yuhao Wu, Ning Zhang, Chenguang Wang, Yevgeniy Vorobeychik, Chaowei Xiao.

In Proc. of the 40th International Conf. on Machine Learning (ICML 2023).


Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study

Myeongseob Ko, Ming Jin, Chenguang Wang, Ruoxi Jia.

In International Conf. on Computer Vision (ICCV 2023).


2022

DeepStruct: Pretraining of language models for structure prediction

Chenguang Wang*, Xiao Liu*, Zui Chen*, Haoyun Hong, Jie Tang, and Dawn Song.

In Proc. 2022 Annual Meeting of the Association for Computational Linguistics (ACL 2022).


Joint language semantic and structure embedding for knowledge graph completion

Jianhao Shen, Chenguang Wang, Linyuan Gong, and Dawn Song.

In Proc. 2022 Int. Conf. on Computational Linguistics (COLING 2022).


IELM: An Open Information Extraction Benchmark for Pre-Trained Language Models

Chenguang Wang, Xiao Liu and Dawn Song.

In Proc. 2022 Conf. on Empirical Methods on Natural Language Processing (EMNLP 2022).


PALT: Parameter-Lite Transfer of Language Models for Knowledge Graph Completion

Jianhao Shen, Chenguang Wang, Ye Yuan, Jiawei Han, Heng Ji, Koushik Sen, Ming Zhang and Dawn Song.

In Proc. 2022 Conf. on Empirical Methods on Natural Language Processing (EMNLP 2022).


Benchmarking Language Models for Code Syntax Understanding

Da Shen, Xinyun Chen, Chenguang Wang, Koushik Sen and Dawn Song.

In Proc. 2022 Conf. on Empirical Methods on Natural Language Processing (EMNLP 2022).


Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models

Zhiyuan Zhang, Lingjuan Lyu, Xingjun Ma, Chenguang Wang and Xu Sun.

In Proc. 2022 Conf. on Empirical Methods on Natural Language Processing (EMNLP 2022).


Protecting intellectual property of language generation APIs with lexical watermark

Xuanli He, Qiongkai Xu, Lingjuan Lyu, Fangzhao Wu, and Chenguang Wang.

In Proc. 2022 AAAI Conf. on Artificial Intelligence (AAAI 2022).


Improving representation of the AOD to PM2.5 relationship with a convolutional neural network

Siyuan Shen, Aaron van Donkelaar, Randall V. Martin, Nathan Jacobs, and Chenguang Wang.

In Proc. 2022 Advancing Earth and Space Science (AGU 2022).


2021

Zero-shot information extraction as a unified text-to-triple translation

Chenguang Wang, Xiao Liu, Zui Chen, Haoyun Hong, Jie Tang, and Dawn Song.

In Proc. 2021 Conf. on Empirical Methods on Natural Language Processing (EMNLP 2021).


2020

Language models are open knowledge graphs

Chenguang Wang, Xiao Liu, and Dawn Song.

In arXiv preprint arXiv:2010.11967 (arXiv 2020).


GluonCV and GluonNLP: Deep learning in computer vision and natural language processing

Jian Guo, He He, Tong He, Leonard Lausen, Mu Li, Haibin Lin, Xingjian Shi, Chenguang Wang, Junyuan Xie, Aston Zhang, Hang Zhang, Zhi Zhang, Zhongyue Zhang, and Shuai Zheng.

In Journal of Machine Learning Research (JMLR 2020).


PoD: Positional dependency-based word embedding for aspect term extraction

Yichun Yin, Chenguang Wang, and Ming Zhang.

In Proc. 2020 Int. Conf. on Computational Linguistics (COLING 2020).


Transformer on a diet

Chenguang Wang, Zihao Ye, Aston Zhang, Zheng Zhang, and Alexander Smola.

In arXiv preprint arXiv:2002.06170 (arXiv 2020).


2019

Language models with Transformers

Chenguang Wang, Mu Li, and Alexander Smola.

In arXiv preprint arXiv:1904.09408 (arXiv 2019).


From shallow to deep language representations: Pre-training, fine-tuning, and beyond

Aston Zhang, Haibin Lin, Chenguang Wang, Mu Li, and Alexander Smola.

In Proc. 2019 ACM SIGKDD Int. Conf.on Knowledge Discovery and Data Mining (KDD 2019).


Co-occurrent features in semantic segmentation

Hang Zhang, Han Zhang, Chenguang Wang, and Junyuan Xie.

In Proc. of the IEEE Conf. on Computer Vision and Pattern Recognition (CVPR 2019).


2018

Unsupervised meta-path selection for similarity measure on heterogeneous information networks

Chenguang Wang, Yangqiu Song, Haoran Li, Ming Zhang, and Jiawei Han.

In Proc. 2018 Data Mining and Knowledge Discovery (DMKD 2018).


2017

Distant meta-path similarities for text-based heterogeneous information networks

Chenguang Wang, Yangqiu Song, Haoran Li, Yizhou Sun, Ming Zhang, and Jiawei Han.

In Proc. 2017 ACM Int. Conf. on Information and Knowledge Management (CIKM 2017).


Crowd-in-the-loop: A hybrid approach for annotating semantic roles

Chenguang Wang, Alan Akbik, Laura Chiticariu, Yunyao Li, Fei Xia, and Anbang Xu.

In Proc. 2017 Conf. on Empirical Methods on Natural Language Processing (EMNLP 2017).


Active learning for black-box semantic role labeling with neural factors

Chenguang Wang, Laura Chiticariu, and Yunyao Li.

In Proc. 2017 Int. Joint Conf. on Artificial Intelligence (IJCAI 2017).


Semi-supervised learning over heterogeneous information networks by ensemble of meta-graph guided random walks

He Jiang, Yangqiu Song, Chenguang Wang, Ming Zhang, and Yizhou Sun.

In Proc. 2017 Int. Joint Conf. on Artificial Intelligence (IJCAI 2017).


Towards re-defining relation understanding in financial domain

Chenguang Wang, Doug Burdick, Laura Chiticariu, Rajasekar krishnamurthy, Yunyao Li, and Huaiyu Zhu.

In Proc. of 2017 ACM SIGMOD Int. Conf. on Management of Data Workshop (SIGMOD 2017 Workshop).


HINE: Heterogeneous information network embedding

Yuxin Chen, and Chenguang Wang.

In Proc. 2017 Int. Conf. on Database Systems for Advanced Applications (DASFAA 2017).


2016

World knowledge as indirect supervision for document clustering

Chenguang Wang, Yangqiu Song, Ahmed El-Kishky, Dan Roth, Ming Zhang, and Jiawei Han.

In ACM Transactions on Knowledge Discovery from Data (TKDD 2016).


Chenguang Wang, Yizhou Sun, Yanglei Song, Jiawei Han, Yangqiu Song, Lidan Wang, and Ming Zhang.

In Proc. 2016 SIAM Int. Conf. on Data Mining (SDM 2016).


Text classification with heterogeneous information network kernels

Chenguang Wang, Yangqiu Song, Haoran Li, Ming Zhang, and Jiawei Han.

In Proc. 2016 AAAI Conf. on Artificial Intelligence (AAAI 2016).


2015

KnowSim: A document similarity measure on structured heterogeneous information networks

Chenguang Wang, Yangqiu Song, Haoran Li, Ming Zhang, and Jiawei Han.

In Proc. of 2015 IEEE Int. Conf. on Data Mining (ICDM 2015).


Constrained information-theoretic tripartite graph clustering to identify semantically similar relations

Chenguang Wang, Yangqiu Song, Dan Roth, Chi Wang, Jiawei Han, Heng Ji, and Ming Zhang.

In Proc. 2015 Int. Joint Conf. on Artificial Intelligence (IJCAI 2015).


Incorporating world knowledge to document clustering via heterogeneous information networks

Chenguang Wang, Yangqiu Song, Ahmed El-Kishky, Dan Roth, Ming Zhang, and Jiawei Han.

In Proc. 2015 ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining (KDD 2015).


Spectral label refinement for noisy and missing text labels

Yangqiu Song, Chenguang Wang, Ming Zhang, Hailong Sun, and Qiang Yang.

In Proc. 2015 AAAI Conf. on Artificial Intelligence (AAAI 2015).


2014

Measuring domain influence in heterogeneous networks

Quan Liu, Chenguang Wang, and Ming Zhang.

In Proc. 2014 ACM Int. Conf. on Web Search and Data Mining Workshop on Diffusion Networks and Cascade Analytics (WSDM 2014 Workshop).


2013

Chenguang Wang, Nan Duan, Ming Zhou, and Ming Zhang.

In Proc. 2013 Annual Meeting of the Association for Computational Linguistics (ACL 2013).


2011

ENGtube: An integrated subtitle environment for ESL

Chi-Ho Li, Shujie Liu, Chenguang Wang, and Ming Zhou.

In MT Summit XIII: the Thirteenth Machine Translation Summit (MTSummit 2011).