banner

Visualization for a new era: Impact and application of large language models and AIGC to traditional business models

Qianqian Yang, Ngai Cheong, Dejiang Wang, Shi Li, Oi Neng Lei

Abstract


This paper focuses on the application and business value of large-scale language models, such as GPT and Ernie’s model. These models combined with AIGC tools like stable diffusion generate images with fixed styles, character traits, and continuous plots using randomized story scripts. As a result, it enhances the operational efficiency between or within industries widely, and it fully demonstrate their business value. On the technical side, this paper describes in detail of building a pipeline to generate cue words required for stable diffusion, in which using large-scale language models and story scripts. Subsequently, the limitations of text-to-image are summarized by comparing the traditional method and language model, i.e. comparing characteristics from traditional book production and images generated using language model’s cue words. This leads to a supervised multiround iterative LoRA modeling scheme that utilizes CLIP to achieve character IP fixation. To evaluate the impact of the application direction, we combine application scenarios and researches on application aspects regarding current AIGC industry structure, we found that the AIGC tool has several major aspects, mainly includes the aspects of basic big model, industry and scenario models, business and domain small models, AI infrastructure and AIGC supporting services. big model and AIGC techniques generate images with no specific rules and have less limitation. We call this ‘visualization’ in the new AI era. In this paper, we explore the possible impacts and economic values when changing from traditional domain to the new AI ear.


Keywords


large-scale language models; AIGC tools; image generation; operational efficiency; conversion of text into customized pictures; visualization in the new AI era; application scenarios; LoRA modeling scheme

Full Text:

PDF

References


1. Zarifhonarvar A. Economics of ChatGPT: a labor market view on the occupational impact of artificial intelligence. Journal of Electronic Business & Digital Economics. Published online December 5, 2023. doi: 10.1108/jebde-10-2023-0021

2. Qi Z, Yu Y, Tu M, et al. FoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph Prompt. ArXiv. 2023, arXiv:2308.10173. doi: 10.48550/arXiv.2308.10173

3. Kasneci E, Sessler K, Küchemann S, et al. ChatGPT for good? On opportunities and challenges of large language models for education. Learning and Individual Differences. 2023; 103: 102274. doi: 10.1016/j.lindif.2023.102274

4. Liu X, Zhu Z, Liu H, et al. WavJourney: Compositional Audio Creation with Large Language Models. ArXiv. 2023, arXiv:2307.14335. doi: 10.48550/arXiv.2307.14335

5. Chen J, Liu Z, Huang X. When Large Language Models Meet Personalization: Perspectives of Challenges and Opportunities. ArXiv. 2023, arXiv:2307.16376. doi: 10.48550/arXiv.2307.16376

6. Li Y. Intelligent Environmental Art Design Combining Big Data and Artificial Intelligence. Complexity. 2021; 2021: 1-11. doi: 10.1155/2021/1606262

7. Törnberg P. How to use LLMs for Text Analysis. ArXiv. 2023, arXiv:2307.13106. doi: 10.48550/arXiv.2307.13106

8. Jin Z, Song Z. Generating coherent comic with rich story using ChatGPT and Stable Diffusion. ArXiv. 2023, arXiv:2305.11067. doi: 10.48550/arXiv.2307.11067

9. Hu EJ, Shen Y, Wallis P, et al. LoRA: Low-Rank Adaptation of Large Language Models. ArXiv. 2021, arXiv:2106.09685. doi: 10.48550/arXiv.2106.09685

10. Qi Z, Yu Y, Tu M, et al. FoodGPT: A Large Language Model in Food Testing Domain with Incremental Pre-training and Knowledge Graph Prompt. ArXiv. 2023, arXiv:2308.10173. doi: 10.48550/arXiv.2308.10173

11. Liu X, Zhu Z, Liu H, et al. WavJourney: Compositional Audio Creation with Large Language Models. ArXiv. 2023, arXiv:2307.14335. doi: 10.48550/arXiv.2307.14335

12. Lai T, Xie C, Ruan M, Wang Z, Lu H, Fu S. Influence of artificial intelligence in education on adolescents’ social adaptability: The mediatory role of social support. Uddin MZ, ed. PLOS ONE. 2023; 18(3): e0283170. doi: 10.1371/journal.pone.0283170

13. Przybilla L, Klinker K, Lang M, Schreieck M, Wiesche M, Krcmar H. Design Thinking in Digital Innovation Projects—Exploring the Effects of Intangibility. IEEE Transactions on Engineering Management. 2022; 69(4): 1635-1649. doi: 10.1109/tem.2020.3036818

14. Zhang C, Zhang C, Zhang C, et al. A Complete Survey on Generative AI (AIGC): Is ChatGPT from GPT-4 to GPT-5 All You Need? ArXiv. 2023, arXiv:2303.11717. doi: 10.48550/arXiv.2303.11717

15. Available online: http://news.10jqka.com.cn/field/sr/20230830/44188511.shtml (accessed on 2 June 2023).

16. Lecler A, Duron L, Soyer P. Revolutionizing radiology with GPT-based models: Current applications, future possibilities and limitations of ChatGPT. Diagn Interv Imaging. 2023 Jun;104(6):269-274. doi: 10.1016/j.diii.2023.02.003

17. Available online: http://news.10jqka.com.cn/field/sr/20230607/43107125.shtml (accessed on 2 June 2023).

18. Zhang C, Zhang C, Li C. One Small Step for Generative AI, One Giant Leap for AGI: A Complete Survey on ChatGPT in AIGC Era. ArXiv. 2023, arXiv:2304.06488. doi: 10.48550/arXiv.2304.06488




DOI: https://doi.org/10.32629/jai.v7i4.1487

Refbacks

  • There are currently no refbacks.


Copyright (c) 2024 Qianqian Yang, Ngai Cheong, Dejiang Wang, Shi Li, Oi Neng Lei

License URL: https://creativecommons.org/licenses/by-nc/4.0/