[NLP] Introduction to NLP

2023. 3. 21. 10:38
๐Ÿง‘๐Ÿป‍๐Ÿ’ป ์ฃผ์š” ์ •๋ฆฌ
 
NLP
The history of NLP
The field of NLP
Ambiguous
Sparsity
Variation

 

 

๋ฐฐ๊ฒฝ ์ง€์‹

NLP๋ž€ ๋ฌด์—‡์ผ๊นŒ์š”?

 

https://google.com

 

๊ตฌ๊ธ€์— ๊ฒ€์ƒ‰ํ•ด๋ณด๋ฉด ์œ„์™€ ๊ฐ™์€ ๊ฒฐ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

 

์˜์–ด๋กœ ๋ณด์ž๋ฉด,

 

NLP(Natural Language Processing) is a field of computer science and artificial intelligence concerned with enabling computers to understand, interpret, and generate human language.

 

์ด๋ ‡์Šต๋‹ˆ๋‹ค.

 

NLP์˜ ๋ชฉํ‘œ?

 

๋ชฉํ‘œํ•˜๋Š” ๋ฐ”๋Š”, ๋ถ„์„๊ฐ€๋Šฅํ•˜๊ณ , ์ดํ•ด๊ฐ€๋Šฅํ•˜๋ฉฐ, human language๋ฅผ ๊ตฌํ˜„ํ•  ์ˆ˜ ์žˆ๋Š” ์‹œ์Šคํ…œ์„ ๋งŒ๋“œ๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.

 

NLP์˜ ์—ญ์‚ฌ?

 

์œ„์™€ ๊ฐ™์€ ์—ญ์‚ฌ๋ฅผ ๊ฐ€์ง€๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

 

์—ฌ๊ธฐ์„œ Statistical models์ด ์žˆ์Šต๋‹ˆ๋‹ค.

 

์ด๋Š” ํ™•๋ฅ ์— ๊ธฐ๋ฐ˜ํ•œ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.

 

 

 

๊ทธ๋ฆฌ๊ณ , ์ด๊ณณ์—์„œ ์šฐ๋ฆฌ๋Š” ์ตœ๊ทผ์˜ ์—ญ์‚ฌ๋ฅผ ์‚ดํŽด๋ณผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

 

ํ˜„์žฌ OpenAI์—์„œ ๊ฐœ๋ฐœํ•œ ChatGPT๋ผ๋Š” chat bot์ด ์ง€๊ธˆ์—์„œ์•ผ ๊ฐ๊ด‘๋ฐ›๋Š” ์ด์œ ๋Š” ๋ฌด์—‡์ผ๊นŒ์š”?

 

์‚ฌ์‹ค CNN, RNN์˜ ์ด๋ก ๋“ค์€ ์˜ˆ์ „๋ถ€ํ„ฐ ์กด์žฌํ–ˆ์Šต๋‹ˆ๋‹ค.

 

๊ทธ๋Ÿฌ๋‚˜ ์ตœ๊ทผ 2013๋…„๋ถ€ํ„ฐ GPU๋ฅผ ์ด์šฉํ•œ ๋ณ‘๋ ฌ ์—ฐ์‚ฐ์ด ๊ฐ€๋Šฅํ•ด์ ธ, ์ด์ „์— ์ด๋ก ์ƒ CPU๋กœ๋Š” ์—ฐ์‚ฐํ•  ์ˆ˜ ์—†์—ˆ๋˜ ๊ฒƒ๋“ค์ด ๊ฐ€๋Šฅํ•ด์กŒ์Šต๋‹ˆ๋‹ค.

 

๊ทธ๋ž˜์„œ ์ง€๊ธˆ์—์„œ์•ผ generative mode์ธ Seq2Seq model์ด๋‚˜, Attention, Pretrained Models, GPT-4์™€ ๊ฐ™์€ ๊ฒƒ๋“ค์ด ๊ฐ•๋ ฅํ•œ ํž˜์„ ๊ฐ–๊ฒŒ ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

 

 

NLP์˜ ์‚ฌ์šฉ?

 

๋ถ„์•ผ๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

 

  • Sentiment / Emotion analysis
  • Machine translation
  • Vision and language
  • Chatbot, Conversational AI
  • Question answering
  • Text Summarization
  • Code generation
  • Story generation

 

 

์œ„ ๋ถ„์•ผ๋“ค์€, NLP ๋ถ„์•ผ์—์„œ ์ž์ฃผ ์‚ฌ์šฉ๋˜๋ฉฐ ์—ฐ๊ตฌ ์ค‘์ž…๋‹ˆ๋‹ค.

 

์œ„ ๋ถ„์•ผ๋Š” Classification NLP์™€ Generation NLP๋กœ ๋‚˜๋ˆŒ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

 

๋ฌธ์žฅ์„ ์ƒ์„ฑํ•˜๋Š” Seq2Seq ๊ฐ™์€ ๊ฒƒ์ด Generation model์ด๊ณ , Classification model์€ ์–ด๋–ค data์—์„œ ์›ํ•˜๋Š” ๊ฐ’์„ ๊ฐ€์ ธ์˜ค๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.

 

 

NLP๋ฅผ ์ด์šฉํ•œ APP?

 

์œ„ ๋ถ„์•ผ์— ๋Œ€ํ•œ ์—ฌ๋Ÿฌ ๊ฐ€์ง€ Application์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

 

 

  • DeepMoji
  • Google Neural Machine Translation
  • SQuAD
  • Visual Question Answering
  • Microsoft DialoGPT
  • Google Meena
  • Google PEGASUS
  • Facebook AI Research TransCoder
  • Github Copilot
  • OpenAI GPT-4
  • OpenAI Jukebox
  • Protein folding problem
  • OpenAI DALL-E2
  • Google IMAGEN

 

 

 

NLP์˜ ์–ด๋ ค์›€?

 

NLP์—๋Š” ์–ด๋– ํ•œ ์–ด๋ ค์›€์ด ์กด์žฌํ• ๊นŒ์š”?

 

ํ•˜๋‚˜์”ฉ ์‚ดํŽด๋ด…์‹œ๋‹ค.

 

 

Ambiguous

 

Words have many meanings.

 

๋‹จ์–ด์—๋Š” ๋งŽ์€ ๋œป์ด ์žˆ์Šต๋‹ˆ๋‹ค.

 

 

 

์šฐ๋ฆฌ๊ฐ€ ๋จน๋Š” ๋ฐค๋„ ์กด์žฌํ•˜๊ณ ,

 

 

 

 

 

 

 

 

 

๋ฐคํ•˜๋Š˜์„ ๋œปํ•  ๋•Œ ์“ฐ๋Š” ๋ฐค๋„ ์กด์žฌํ•ฉ๋‹ˆ๋‹ค.

 

 

 

 

 

 

 

 

 

 

์ด๋Ÿฌํ•œ ์˜๋ฏธ์˜ ๋ชจํ˜ธ์„ฑ ์†์—์„œ NLP๋Š” ์–ด๋–ป๊ฒŒ ๋ฐœ์ „ํ•  ์ˆ˜ ์žˆ์„๊นŒ์š”?

 

 

 

 

 

Sparsity

 

 

์šฐ๋ฆฌ๋Š” ๋ฌธ์žฅ์„ ๊ตฌ์„ฑํ•  ๋•Œ,

 

๊ฐ€์žฅ ๋งŽ์ด ์“ฐ๋Š” ๋‹จ์–ด๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.

 

์ด๋ฅผ ํ…Œ๋ฉด, "์€,๋Š”,์ด,๊ฐ€", ๋“ฑ๋“ฑ..

 

์˜์–ด์—์„œ๋Š” "is", "the", "a" , etc..

 

๊ทธ๋ ‡๋‹ค๋ฉด ํ†ต๊ณ„ํ•™์ ์œผ๋กœ ๋‹ค์Œ ๋‹จ์–ด๋ฅผ ์˜ˆ์ธกํ•จ์— ์žˆ์–ด ์šฐ๋ฆฌ๋Š” ์œ„์™€ ๊ฐ™์€ ๋‹จ์–ด๋“ค์„ ๊ณ ๋ฅผ ์ˆ˜ ๋ฐ–์— ์—†์Šต๋‹ˆ๋‹ค.

 

์ด๋Ÿฌํ•œ Sparsity๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, NLP์—์„œ๋Š” ์–ด๋–ค ๋ฐฉ์‹์„ ์‚ฌ์šฉํ–ˆ์„๊นŒ์š”?

 

 

 

Variation

 

์šฐ๋ฆฌ๋Š” ์ƒ๊ฐ์„ ๊ฑฐ์ณ์„œ ๋ง์„ ํ•ฉ๋‹ˆ๋‹ค.

 

๊ทธ๋Ÿฌ๋‚˜, ์–ด๋–ค ๊ฒƒ์ด ๋” ์ข‹์€ ๋ฌธ์žฅ์ผ๊นŒ๋ฅผ ํ•ญ์ƒ ์ƒ๊ฐํ•˜์ง€๋Š” ์•Š์ฃ .

 

๊ฒฉ์‹์„ ์ฐจ๋ ค์•ผ ํ•˜๋Š” ์ž๋ฆฌ์—์„œ ์šฐ๋ฆฌ๋Š” ์–ด๋–ค ๊ฒƒ์ด ๋” ์ž˜ ์“ด ๋ฌธ์žฅ์ผ์ง€๋ฅผ ์ƒ๊ฐํ•ฉ๋‹ˆ๋‹ค.

 

ํ˜น์€ ๋…ผ๋ฌธ์„ ์“ธ ๋•Œ๋„ ๊ทธ๋ ‡์ฃ .

 

๊ทธ๋ ‡๋‹ค๋ฉด, NLP์—์„œ๋Š” ์–ด๋–ป๊ฒŒ ์ด๋ฅผ ํ•ด๊ฒฐํ• ๊นŒ์š”?

 

 

 

 

 

 

 

 

๊ณ„์†ํ•ด์„œ ์•Œ์•„๋ด…์‹œ๋‹ค.

 

 

'Artificial Intelligence > Natural Language Processing' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[NLP] Word Embedding - Word2Vec  (0) 2023.03.27
[NLP] Word Embedding - Skip Gram  (0) 2023.03.27
[NLP] Word Embedding - CBOW  (1) 2023.03.27
[NLP] Introduction to Word Embedding  (0) 2023.03.26
[NLP] Overview NLP  (0) 2023.03.21

BELATED ARTICLES

more