Treffer: Predicting Program Properties from 'Big Code'.

Title:
Predicting Program Properties from 'Big Code'.
Authors:
Raychev, Veselin1 veselin.raychev@inf.ethz.ch, Vechev, Martin1 martin.vechev@inf.ethz.ch, Krause, Andreas1 krausea@ethz.ch
Source:
Communications of the ACM. Mar2019, Vol. 62 Issue 3, p99-107. 9p. 1 Color Photograph, 5 Diagrams, 1 Chart, 1 Graph.
Database:
Business Source Premier

Weitere Informationen

We present a new approach for predicting program properties from large codebases (aka "Big Code"). Our approach learns a probabilistic model from "Big Code" and uses this model to predict properties of new, unseen programs. The key idea of our work is to transform the program into a representation that allows us to formulate the problem of inferring program properties as structured prediction in machine learning. This enables us to leverage powerful probabilistic models such as Conditional Random Fields (CRFs) and perform joint prediction of program properties. As an example of our approach, we built a scalable prediction engine called JSNice for solving two kinds of tasks in the context of JavaScript: predicting (syntactic) names of identifiers and predicting (semantic) type annotations of variables. Experimentally, JSNice predicts correct names for 63% of name identifiers and its type annotation predictions are correct in 81% of cases. Since its public release at http://jsnice.org, JSNice has become a popular system with hundreds of thousands of uses. By formulating the problem of inferring program properties as structured prediction, our work opens up the possibility for a range of new "Big Code" applications such as de-obfuscators, decompilers, invariant generators, and others. [ABSTRACT FROM AUTHOR]

Copyright of Communications of the ACM is the property of Association for Computing Machinery and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)