HAConvGNN: Hierarchical Attention Based Convolutional Graph Neural Network for Code Documentation Generation in Jupyter Notebooks

03/31/2021
by   Xuye Liu, et al.
19

Many data scientists use Jupyter notebook to experiment code, visualize results, and document rationales or interpretations. The code documentation generation CDG task in notebooks is related but different from the code summarization task in software engineering, as one documentation (markdown cell) may consist of a text (informative summary or indicative rationale) for multiple code cells. Our work aims to solve the CDG task by encoding the multiple code cells as separated AST graph structures, for which we propose a hierarchical attention-based ConvGNN component to augment the Seq2Seq network. We build a dataset with publicly available Kaggle notebooks and evaluate our model (HAConvGNN) against baseline models (e.g., Code2Seq or Graph2Seq).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2018

Automatic Graphics Program Generation using Attention-Based Hierarchical Decoder

Recent progress on deep learning has made it possible to automatically t...
research
06/21/2021

A Mocktail of Source Code Representations

Efficient representation of source code is essential for various softwar...
research
04/06/2020

Improved Code Summarization via a Graph Neural Network

Automatic source code summarization is the task of generating natural la...
research
11/16/2018

PaccMann: Prediction of anticancer compound sensitivity with multi-modal attention-based neural networks

We present a novel approach for the prediction of anticancer compound se...
research
02/13/2016

Attention-Based Convolutional Neural Network for Machine Comprehension

Understanding open-domain text is one of the primary challenges in natur...
research
11/23/2022

Explainable AI for Pre-Trained Code Models: What Do They Learn? When They Do Not Work?

In recent years, there has been a wide interest in designing deep neural...
research
08/23/2021

ComSum: Commit Messages Summarization and Meaning Preservation

We present ComSum, a data set of 7 million commit messages for text summ...

Please sign up or login with your details

Forgot password? Click here to reset