NetGPT: A Native-AI Network Architecture Beyond Provisioning Personalized Generative Services

07/12/2023
by   Yuxuan Chen, et al.
0

Large language models (LLMs) have triggered tremendous success to empower daily life by generative information, and the personalization of LLMs could further contribute to their applications due to better alignment with human intents. Towards personalized generative services, a collaborative cloud-edge methodology sounds promising, as it facilitates the effective orchestration of heterogeneous distributed communication and computing resources. In this article, after discussing the pros and cons of several candidate cloud-edge collaboration techniques, we put forward NetGPT to capably deploy appropriate LLMs at the edge and the cloud in accordance with their computing capacity. In addition, edge LLMs could efficiently leverage location-based information for personalized prompt completion, thus benefiting the interaction with cloud LLMs. After deploying representative open-source LLMs (e.g., GPT-2-base and LLaMA model) at the edge and the cloud, we present the feasibility of NetGPT on the basis of low-rank adaptation-based light-weight fine-tuning. Subsequently, we highlight substantial essential changes required for a native artificial intelligence (AI) network architecture towards NetGPT, with special emphasis on deeper integration of communications and computing resources and careful calibration of logical AI workflow. Furthermore, we demonstrate several by-product benefits of NetGPT, given edge LLM's astonishing capability to predict trends and infer intents, which possibly leads to a unified solution for intelligent network management & orchestration. In a nutshell, we argue that NetGPT is a promising native-AI network architecture beyond provisioning personalized generative services.

READ FULL TEXT

page 1

page 5

research
10/01/2020

Towards Self-learning Edge Intelligence in 6G

Edge intelligence, also called edge-native artificial intelligence (AI),...
research
06/02/2023

An Overview on Generative AI at Scale with Edge-Cloud Computing

As a specific category of artificial intelligence (AI), generative artif...
research
05/18/2021

AI-Native Network Slicing for 6G Networks

With the global roll-out of the fifth generation (5G) networks, it is ne...
research
03/04/2021

Toward Native Artificial Intelligence in 6G Networks: System Design, Architectures, and Paradigms

The mobile communication system has transformed to be the fundamental in...
research
03/08/2023

KubeEdge-Sedna v0.3: Towards Next-Generation Automatically Customized AI Engineering Scheme

The scale of the global edge AI market continues to grow. The current te...
research
05/04/2023

DECICE: Device-Edge-Cloud Intelligent Collaboration Framework

DECICE is a Horizon Europe project that is developing an AI-enabled open...
research
10/07/2022

In-situ Model Downloading to Realize Versatile Edge AI in 6G Mobile Networks

The sixth-generation (6G) mobile networks are expected to feature the ub...

Please sign up or login with your details

Forgot password? Click here to reset