A Bytecode-based Approach for Smart Contract Classification

05/31/2021
by   Chaochen Shi, et al.
0

With the development of blockchain technologies, the number of smart contracts deployed on blockchain platforms is growing exponentially, which makes it difficult for users to find desired services by manual screening. The automatic classification of smart contracts can provide blockchain users with keyword-based contract searching and helps to manage smart contracts effectively. Current research on smart contract classification focuses on Natural Language Processing (NLP) solutions which are based on contract source code. However, more than 94 application scenarios of NLP methods are very limited. Meanwhile, NLP models are vulnerable to adversarial attacks. This paper proposes a classification model based on features from contract bytecode instead of source code to solve these problems. We also use feature selection and ensemble learning to optimize the model. Our experimental studies on over 3,300 real-world Ethereum smart contracts show that our model can classify smart contracts without source code and has better performance than baseline models. Our model also has good resistance to adversarial attacks compared with NLP-based models. In addition, our analysis reveals that account features used in many smart contract classification models have little effect on classification and can be excluded.

READ FULL TEXT
research
07/19/2020

STAN: Towards Describing Bytecodes of Smart Contract

More than eight million smart contracts have been deployed into Ethereum...
research
06/02/2023

SourceP: Smart Ponzi Schemes Detection on Ethereum Using Pre-training Model with Data Flow

As blockchain technology becomes more and more popular, a typical financ...
research
06/11/2021

SolcTrans: Towards machine translation of Solidity smart contract source code

Context: Decentralized applications on blockchain platforms are realized...
research
06/07/2023

Vulnerable Smart Contract Function Locating Based on Multi-Relational Nested Graph Convolutional Network

The immutable and trustable characteristics of blockchain enable smart c...
research
10/06/2022

Single-Use Delegatable Signatures Based on Smart Contracts

Delegation of cryptographic signing rights has found many application in...
research
11/28/2021

Semantic Code Search for Smart Contracts

Semantic code search technology allows searching for existing code snipp...
research
10/05/2021

ContractNLI: A Dataset for Document-level Natural Language Inference for Contracts

Reviewing contracts is a time-consuming procedure that incurs large expe...

Please sign up or login with your details

Forgot password? Click here to reset