I am Seokwon Jang, the CEO of DeepNetwork, a one-person AI startup.

 

Over the past three years, I have dedicated myself to securing the foundational technology for building GPT-3 models.

I have been analyzing the detailed design and principles of GPT-3's architecture for more than three years. Initially, I struggled to understand why the GPT-3 model was designed in such a way. Specifically, I couldn't grasp how the large language model (LLM) functions with only the decoder part of the transformer model, while the encoder part is omitted.

Now, after three years of detailed analysis, I know GPT-3 inside and out. Although I haven't been able to conduct practical experiments due to the lack of deep learning server infrastructure, I have achieved an expert-level understanding of the TensorFlow implementation of GPT-3, and I am capable of working on its development at a professional level.

I thoroughly understand the structural design of GPT-3, why it is built the way it is, and how each part processes and operates. Initially, I thought that understanding GPT-3's architecture would be enough, but as I dug deeper, I realized that the processing of tokenization and embedding, especially for Korean and English, is the core of its functionality. It took me months to fully understand this critical aspect.

When I analyze a system, I focus on breaking down its principles—its algorithms, design structures, and operational mechanisms. In particular, I have invested a significant amount of time and effort into analyzing and understanding Korean tokenization and embedding processes. This was a challenging task, but ultimately, I succeeded in mastering it.

Based on this extensive effort to secure the foundational technology of GPT-3 models, my one-person AI startup, DeepNetwork, is now ready to pursue commercialization of this expertise.

 

One_person AI Startup DeepNetwork CEO /  SeokWeon Jang  /  sayhi7@daum.net

 

 

+ Recent posts