HyperAIHyperAI

Command Palette

Search for a command to run...

5 months ago

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation

CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding
  and Generation

Abstract

Benchmark datasets have a significant impact on accelerating research inprogramming language tasks. In this paper, we introduce CodeXGLUE, a benchmarkdataset to foster machine learning research for program understanding andgeneration. CodeXGLUE includes a collection of 10 tasks across 14 datasets anda platform for model evaluation and comparison. CodeXGLUE also features threebaseline systems, including the BERT-style, GPT-style, and Encoder-Decodermodels, to make it easy for researchers to use the platform. The availabilityof such data and baselines can help the development and validation of newmethods that can be applied to various program understanding and generationproblems.

Code Repositories

Avmb/semantic_neq_game
Mentioned in GitHub
yueyuel/programgen-lms-reliability
pytorch
Mentioned in GitHub
facebookresearch/CodeGen
Mentioned in GitHub
deeplearnxmu/unigencoder
pytorch
Mentioned in GitHub
microsoft/CodeXGLUE
Official
pytorch
Mentioned in GitHub
sberbank-ai/fusion_brain_aij2021
pytorch
Mentioned in GitHub

Benchmarks

BenchmarkMethodologyMetrics
cloze-test-on-codexglue-ct-allCodeBERT(MLM)
Go: 83.31
JS: 81.77
Java: 80.63
PHP: 85.05
Python: 87.21
Ruby: 80.17
cloze-test-on-codexglue-ct-maxminCodeBERT(MLM)
Go: 90.79
JS: 86.4
Java: 90.46
PHP: 88.21
Python: 82.2
Ruby: 86.84
code-repair-on-codexglue-bugs2fixCodeBERT
Accuracy (medium): 5.2
Accuracy (small): 16.4
BLEU (medium): 91.07
BLEU (small): 77.42
CodeBLEU (medium): 87.52
CodeBLEU (small): 75.58
code-search-on-codexglue-advtestCodeBERT
MRR: 27.19
code-search-on-codexglue-webquerytestCodeBERT
Accuracy: 47.8
F1: 58.95
code-translation-on-codexglue-codetransCodeBERT
Accuracy (C#→Java): 58
Accuracy (Java→C#): 59
BLEU (C#→Java): 72.14
BLEU (Java→C#): 79.92
CodeBLEU (C#→Java): 79.41
CodeBLEU (Java→C#): 85.1
defect-detection-on-codexglue-devignCodeBERT
Accuracy: 62.08
text-to-code-generation-on-codexglue-concodeCodeGPT-adapted
BLEU: 32.79
CodeBLEU: 27.74
EM: 20.1

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation | Papers | HyperAI