arXiv Analytics

Sign in

arXiv:2308.07505 [cs.LG]AbstractReferencesReviewsResources

Data Race Detection Using Large Language Models

Le Chen, Xianzhong Ding, Murali Emani, Tristan Vanderbruggen, Pei-hung Lin, Chuanhua Liao

Published 2023-08-15Version 1

Large language models (LLMs) are demonstrating significant promise as an alternate strategy to facilitate analyses and optimizations of high-performance computing programs, circumventing the need for resource-intensive manual tool creation. In this paper, we explore a novel LLM-based data race detection approach combining prompting engineering and fine-tuning techniques. We create a dedicated dataset named DRB-ML, which is derived from DataRaceBench, with fine-grain labels showing the presence of data race pairs and their associated variables, line numbers, and read/write information. DRB-ML is then used to evaluate representative LLMs and fine-tune open-source ones. Our experiment shows that LLMs can be a viable approach to data race detection. However, they still cannot compete with traditional data race detection tools when we need detailed information about variable pairs causing data races.

Related articles: Most relevant | Search more
arXiv:2303.02206 [cs.LG] (Published 2023-03-03, updated 2023-08-23)
Domain Specific Question Answering Over Knowledge Graphs Using Logical Programming and Large Language Models
arXiv:2306.07567 [cs.LG] (Published 2023-06-13)
Large Language Models Sometimes Generate Purely Negatively-Reinforced Text
arXiv:2306.04634 [cs.LG] (Published 2023-06-07)
On the Reliability of Watermarks for Large Language Models