I focused mainly on implementing Korean inference, including Korean tokenizing and embedding, based on the GPT-3 Model and successfully secured the know-how.

파란새 2024. 12. 12. 05:40

2024. 12. 12. 05:40

I am Seokweon Jang, CEO of the solo AI startup Deep Network.

GPT-3 LLM AI One-Parson startup Deep Network / sayhi7@daum.net

Even when GPT-3 was announced in June 2020, you knew that the fundamental requirement for implementing LLMs is to secure a large amount of training data, right? In the case of GPT-3, 90% of the training data consisting of 500B tokens was collected and processed using web crawling. Significant know-how in web backend design technology is also important when developing the GPT-3 Foundation Model. I focused mainly on implementing Korean inference, including Korean tokenizing and embedding, based on the GPT-3 Model and successfully secured the know-how. In fact, if I were to implement RAG search functionality, I planned to implement a limited search function to obtain the necessary information for RAG search by targeting specific sites like the Arxiv paper site, as I lack web crawling skills. I think I intended to use the API provided by the Arxiv paper site to obtain metadata because I lack web crawling skills. I know that a major company in Korea has been developing a Document Parser using deep learning OCR models for nearly 10 years. I intend to parse PDF documents directly. There are several open-source libraries for parsing PDF documents, but I also have key information for parsing PDF documents. I understand the key steps and methods for implementing tokenizing and embedding at the morpheme level to apply Korean to the GPT-3 Model. I went through some hardships to grasp the key procedures and methods of implementing tokenizing and embedding at the morpheme level to apply Korean to the GPT-3 Model. Nowadays, global companies are also focusing on specific inference technology issues as part of LLM commercialization. I believe the core issue among LLM commercialization issues is parsing PDF documents, and I understand the key issues of parsing PDF documents. In fact, if I were to implement RAG search functionality, I planned to implement a limited search function to obtain the necessary information (metadata information of PDF papers) by targeting specific sites like the Arxiv paper site because I lack web crawling skills. I understand the core implementation techniques for implementing multiple tasks in a multitask structure with specific datasets to perform learning and inference with multiple (tens of) benchmark datasets in the GPT-3 Model structure. There is much more I could tell you, but I will mention only this much. The details are confidential to my solo AI startup Deep Network and cannot be disclosed.

저작자표시 비영리 동일조건

'Kernel Porting > Linux' 카테고리의 다른 글

정밀 CAN 통신 디버깅 툴로는 Vector 사의 CANoe 와 PCAN Explorer 가 대표적입니다. 이 외에도 CANlink 와 고급 오실로스코프 같은 고가의 도구가 반드시 필요합니다 (0)	2024.12.14
[일인 AI 스타트업 딥네트워크 장석원][GPT-3 LLM 의 세부 구현 구조 및 동작원리(세부 구현 처리 방법까지)를 제가 확실히 분석하려고 한 일년반 분석해 결국 분석 성공했읍니다 ....] (2)	2024.12.14
제가 운영하는 AI 스타트업 딥네트워크는 그동안 GPT-3 Model 설계 구조 기반으로 LLM 관련 데이터셋으로 학습시 이렇게 학습 데이터셋이 거의 수백개에 이를때에는 멀티 타스크 학습 처리에 대한 고민을 했읍니다 ... (2)	2024.12.09
저도 일인 AI 스타트업이고 제 GPT-3 Model 관련 블로그 글 자세히 검토해 주시고 ... 저희 기업의 노하우는 여기에 공개하기가 어려운 점 이해해 주셨으면 합니다 ... (4)	2024.11.29
Portfolio of Jang Seok-Won, Age 60 - Founder of DeepNetwork, a one-person startup preparing for the commercialization of LLM-based AI and robotic joint control technology. (0)	2024.11.28

GPT-3 LLM 세부 알고리즘 분석 일인 AI 스타트업 딥네트워크

I focused mainly on implementing Korean inference, including Korean tokenizing and embedding, based on the GPT-3 Model and successfully secured the know-how.

'Kernel Porting > Linux' 카테고리의 다른 글

+ Recent posts

티스토리툴바