big code: Code Completion/Suggestion 發展簡史

原創

大黄老鼠

2020-05-19 21:43

模型

統計語言模型(n-gram和RNN)

Code Completion with Statistical Language Models [ACM SIGPLAN Notices 2014]

RNN

Toward Deep Learning Software Repositories [MSR 2015]

決策樹

Probabilistic Model for Code with Decision Trees [OOPSLA 2016]

LSTM

Neural Code Completion [ICLR 2017]

LSTM+attention+pointer

Code Completion with Neural Attention and Pointer Networks [IJCAI 2018]

數據集效果

js150

論文	Type	Value
PHOG: Probabilistic Model for Code	81.5%	74.1%
Probabilistic Model for Code with Decision Trees	83.9%	82.9%
NEURAL CODE COMPLETION	84.8%	76.6%
Code Completion with Neural Attention and Pointer Networks	88.6%	81.0%

注意：
NEURAL CODE COMPLETION 有很多組數據，這裏按Code Completion with Neural Attention and Pointer Networks作比較的數據算

其他預測

Learning Programs from Noisy Data

py150

論文	Type	Value
Probabilistic Model for Code with Decision Trees	76.3%	69.2%
Code Completion with Neural Attention and Pointer Networks	80.6%	70.1%

自己爬的GitHub(未公開數據集)

LEARNING PYTHON CODE SUGGESTION WITH A SPARSE POINTER NETWORK

預測標識符(Python)

準確率取TopK	性能
TOP1	63.21%
TOP5	82.62

Toward Deep Learning Software Repositories
預測token (Java) (沒有用AST)

Code Completion with Statistical Language Models

小結

目前這個時間只到2018年，Code Completion with Neural Attention and Pointer Networks 是SOTA。
從傳統機器學習方法到神經網絡模型，在py150和js150上的準確率慢慢得到提升。
傳統機器學習方法暫時看不到突破的希望。
RNN這一塊把能用的都用了。
不知道NMT中的Transform會怎樣？
不知道GNN用上去會怎樣？

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

big code: Code Completion/Suggestion 發展簡史

模型

統計語言模型(n-gram和RNN)

RNN

決策樹

LSTM

LSTM+attention+pointer

數據集效果

js150

py150

自己爬的GitHub(未公開數據集)

小結

[軟件工具百科] 互聯網資源歷史快照歸檔站點與數字圖書館

網易面試：SpringBoot如何開啓虛擬線程？

杭州的 IT 崩盤了麼？

程序員常見的文本查看工具

VS2022 解決方案打不開 .NET Framework 4.0 、 4.5 等老項目

Vue3 運行可以，build 打包發佈報錯，app.config.globalProperties 用法坑

既然測試也要求寫代碼，那乾脆讓開發兼任測試不就好了嗎？

ITSM落地經驗之建設藍圖規劃

PDF 補丁丁 1.0.2 版更新

奇怪！應用的日誌呢？？

換電腦後，Zotero的一些配置

cuda 10 環境下安裝 pytorch_geometric

圖神經網絡學習筆記：Graph Attention Network 淺析

TeXmacs開發：用tm2md將TeXmacs文檔轉換爲markdown文檔

圖神經網絡學習筆記：2018年-2020年 GNN論文簡讀（其他部分）

Mac下配置sublime實現LaTeX

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結