Learning to Complete Code with Sketches - Citegraph

Paper Info

Title
Learning to Complete Code with Sketches

Abstract
Code completion is usually cast as a language modelling problem, i.e., continuing an input in a left-to-right fashion. However, in practice, some parts of the completion (e.g., string literals) may be very hard to predict, whereas subsequent parts directly follow from the context. To handle this, we instead consider the scenario of generating code completions with "holes" inserted in places where a model is uncertain. We develop Grammformer, a Transformer-based model that guides the code generation by the programming language grammar, and compare it to a variety of more standard sequence models. We train the models on code completion for C# and Python given partial code context. To evaluate models, we consider both ROUGE as well as a new metric RegexAcc that measures success of generating completions matching long outputs with as few holes as possible. In our experiments, Grammformer generates 10-50% more accurate completions compared to traditional generative models and 37-50% longer sketches compared to sketch-generating baselines trained with similar techniques.

Year	Venue	Keywords
2022	International Conference on Learning Representations (ICLR)	sketch,generative model,ml4code
DocType	Citations	PageRank
Conference	0	0.34
References	Authors
0	6

Authors (6 rows)

Cited by (0 rows)

References (0 rows)

Name	Order	Citations	PageRank
Daya Guo	1	6	4.81
Alexey Svyatkovskiy	2	0	0.34
Jian Yin	3	861	97.01
Nan Duan	4	213	45.87
Marc Brockschmidt	5	7	4.51
Miltiadis Allamanis	6	505	23.67

1