Neural Networks์˜ ์—ญ์‚ฌ

Neural Networks์˜ ์—ญ์‚ฌ

ย 

NN์˜ ์—ญ์‚ฌ


Mark I Perceptron machine(1957)

ย 
  • Frank Rosenblatt๊ฐ€ ๊ฐœ๋ฐœํ•œ ํผ์…‰ํŠธ๋ก  ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ๊ตฌํ˜„ํ•œ ์ตœ์ดˆ์˜ ๊ธฐ๊ณ„๋กœ W(๊ฐ€์ค‘์น˜)x + b(ํŽธํ–ฅ) ์›๋ฆฌ์— ์˜ํ•ด ์ž‘๋™ํ•œ๋‹ค. ์ถœ๋ ฅ ๊ฐ’์€ 1 ๋˜๋Š” 0์ด๋‹ค. Update ๊ทœ์น™์ด ์กด์žฌํ•ด ์—ญ์ „ํŒŒ์™€ ์œ ์‚ฌํ•ด๋ณด์ด์ง€๋งŒ ๊ฐ€์ค‘์น˜๊ฐ’์„ ์ง์ ‘ ์กฐ์ •ํ•˜๋Š” ๊ฒƒ์— ๊ทธ์ณ ์—ญ์ „ํŒŒ๋ฅผ ๊ตฌํ˜„ํ–ˆ๋‹ค๊ณ  ๋ณด๊ธฐ์—๋Š” ์–ด๋ ค์›€์ด ์žˆ๋‹ค.
notion image

Adaline and Madaline(1960)

  • Widrow์™€ Hoff๊ฐ€ ๊ฐœ๋ฐœํ•œ ๋‰ด๋Ÿด๋„ท์œผ๋กœ, ์ตœ์ดˆ๋กœ ์„ ํ˜•๋ ˆ์ด์–ด๋ฅผ ๋‹ค์ธต ํผ์…‰ํŠธ๋ก  ๋„คํŠธ์›Œํฌ์— ์Œ“๊ธฐ ์‹œ์ž‘ํ–ˆ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์•„์ง ์—ญ์ „ํŒŒ์˜ ์›๋ฆฌ๊ฐ€ ์ ์šฉ๋˜์ง€ ์•Š์•˜๊ณ , ๋”ฐ๋กœ ํ›ˆ๋ จ์‹œํ‚ฌ ๋ฐฉ๋ฒ•์ด ์กด์žฌํ•˜์ง€ ์•Š์•˜๋‹ค.
ย 

์ตœ์ดˆ์˜ Backporp (1986)

  • Rumelhart ๊ฐœ๋ฐœํ–ˆ์œผ๋ฉฐ, ์—ญ์ „ํŒŒ๋ฅผ ๊ตฌํ˜„ํ•จ์œผ๋กœ์„œ ์ตœ์ดˆ๋กœ network๋ฅผ ํ•™์Šต์‹œํ‚ค๋Š” ๊ฐœ๋… ์ •๋ฆฝํ–ˆ๋‹ค. ์ด๋กœ ์ธํ•ด Chain rule๊ณผ Update rule์ด ๋“ฑ์žฅํ•  ์ˆ˜ ์žˆ์—ˆ๋‹ค.
ย 

Geoff Hinton ๊ณผ Ruslan Salakhutdinov์˜ ๋…ผ๋ฌธ (2006)

  • DNN(์‹ฌ์ธต ์‹ ๊ฒฝ๋ง)์˜ ํ•™์Šต๊ฐ€๋Šฅ์„ฑ์ด ์ตœ์ดˆ๋กœ ๋‚˜์˜จ ๋…ผ๋ฌธ์ด๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ ์„ธ์‹ฌํ•œ ์ดˆ๊ธฐํ™”๊ฐ€ ํ•„์š”ํ•˜๋ฉฐ, ์ดˆ๊ธฐํ™”๋ฅผ ์œ„ํ•ด RBM(์ œํ•œ ๋ณผ์ธ ๋งŒ ๋จธ์‹ )์„ ์ด์šฉํ•ด์„œ ๊ฐ ํžˆ๋“ ๋ ˆ์ด์–ด ๊ฐ€์ค‘์น˜ ํ•™์Šต ํ•„์š”ํ•˜๋‹ค๋Š” ํ•œ๊ณ„๊ฐ€ ์žˆ๋‹ค.

NN ์—ดํ’ (2012)

  • Hintin lab์˜ Acoustic Modeling๊ณผ Speech Recognition ์—ฐ๊ตฌ๋“ค์ด ๋‰ด๋Ÿด๋„ท ์—ดํ’์˜ ์‹œ์ž‘์ด์—ˆ๋‹ค. ํŠนํžˆ Alex Krizhevsky์˜ ์˜์ƒ ์ธ์‹ ๊ด€๋ จ Imagenet ๋ถ„๋ฅ˜์—์„œ ์ตœ์ดˆ๋กœ ๋‰ด๋Ÿด๋„ท์„ ์‚ฌ์šฉํ•œ ์—ฐ๊ตฌ๊ฐ€ ๋“ฑ์žฅํ•˜์˜€๋‹ค.
ย