728x90

AI 4

[RNN] 2. Vanialla RNN, LSTM(Long Short-Term Memory), GRU(Gated Recurrent Unit)

์ˆœ์ฐจ ๋ฐ์ดํ„ฐ๋ฅผ ์–ด๋–ป๊ฒŒ ์ˆœํ™˜ ์‹ ๊ฒฝ๋ง์„ ํ†ตํ•ด ์ฒ˜๋ฆฌํ•˜๋Š”์ง€ ์•Œ์•„๋ณด์ž. ๊ธฐ์–ต ์‹œ์Šคํ…œ ๋งŒ์•ฝ ์‹œ๋ฆฌ ๊ฐ™์€ ๊ฐœ์ธ ๋น„์„œ์˜ ๊ฒฝ์šฐ ์˜ฌ๋ฐ”๋ฅธ ๋Œ€๋‹ต์„ ํ•˜๋ ค๋ฉด ์ž…๋ ฅ์„ ๋ฐ›์„ ๋•Œ๋งˆ๋‹ค ๊ทธ ๋‚ด์šฉ์„ ๊ธฐ์–ตํ•ด์•ผ ํ•œ๋‹ค. ๋ฐ˜๋Œ€๋กœ ์ด์ „ ์ž…๋ ฅ์„ ๊ธฐ์–ตํ•˜์ง€ ์•Š๋Š” ์‹œ์Šคํ…œ์€ ๋ฌด๊ธฐ์–ต ์‹œ์Šคํ…œ์ด๋ผ ํ•œ๋‹ค. ์–•์€ ์‹ ๊ฒฝ๋ง(Shallow Neural Network)์ด ๋ฌด๊ธฐ์–ต ์‹œ์Šคํ…œ์˜ ์˜ˆ์‹œ n๋ฒˆ์งธ Time-Step์— ๋Œ€ํ•œ ๊ฒฐ๊ณผ๊ฐ€ ์ด์ „ ์ž…๋ ฅ์— ์˜ํ–ฅ ๋ฐ›์ง€ ์•Š๋Š”๋‹ค. ๊ธฐ๋ณธ์ ์ธ ์ˆœํ™˜ ์‹ ๊ฒฝ๋ง (Vanilla Recurrent Network) Vanilla RNN์˜ ๊ตฌ์กฐ๋Š” shallow NN ๊ตฌ์กฐ์— '์ˆœํ™˜(recurrent)'์ด ์ถ”๊ฐ€๋œ ๊ฒƒ์ด๋‹ค. ์ˆœํ™˜: n-1๋ฒˆ์งธ time step์ด n๋ฒˆ์งธ time step์œผ๋กœ ๋‹ค์‹œ ๋Œ์•„์˜ค๋Š” ๊ฒƒ ๊ธฐ์–ต ์‹œ์Šคํ…œ์ด๋ฏ€๋กœ RNN์˜ ์ถœ๋ ฅ์€ ์ด์ „์˜ ๋ชจ๋“  ์ž…๋ ฅ์— ์˜ํ–ฅ..

[๋…ผ๋ฌธ ๋ฆฌ๋ทฐ] Very Deep Convolutional Networks for Large-Scale Image Recognition ๋ฆฌ๋ทฐ, VGG Net

๋ฌด๋ ค 1๋…„์ „์— ์ •๋ฆฌํ•ด๋†“์€ ๋…ผ๋ฌธ ์˜ฌ๋ฆฌ๊ธฐ ใ…Žใ……ใ…Ž Image Recognition์— ์ž…๋ฌธํ•  ๋•Œ ์ข‹์€ ๋…ผ๋ฌธ์ด๋ผ๊ณ  ์ƒ๊ฐํ•œ๋‹ค. Very Deep Convolutional Networks for Large-Scale Image Recognition arxiv.org/abs/1409.1556 Very Deep Convolutional Networks for Large-Scale Image Recognition In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough eva..

[Machine Learning] ๋จธ์‹ ๋Ÿฌ๋‹ workflow, ๋ฐ์ดํ„ฐ์…‹ ๋ถ„๋ฆฌ

ahnty0122.tistory.com/58 [Deep Learning] ๋”ฅ๋Ÿฌ๋‹/๋จธ์‹ ๋Ÿฌ๋‹/์ธ๊ณต์ง€๋Šฅ์˜ ์ฐจ์ด, ์ธ๊ณต์‹ ๊ฒฝ๋ง ๊ฐœ๋… ๋”ฅ๋Ÿฌ๋‹์— ๋Œ€ํ•ด ์ดํ•ดํ•˜๋ ค๋ฉด ๋จผ์ € ์ธ๊ณต์ง€๋Šฅ, ๋จธ์‹ ๋Ÿฌ๋‹, ๋”ฅ๋Ÿฌ๋‹์˜ ์ฐจ์ด์ ์„ ์•Œ์•„์•ผํ•œ๋‹ค. ์ธ๊ณต์ง€๋Šฅ(AI): ์‚ฌ๋žŒ์˜ ์ง€๋Šฅ์„ ๋ชจ๋ฐฉํ•ด ์‚ฌ๋žŒ์ด ํ•˜๋Š” ๋ณต์žกํ•œ ์ผ์„ ํ•  ์ˆ˜ ์žˆ๋Š” ๊ธฐ๊ณ„๋ฅผ ๋งŒ๋“œ๋Š” ๊ฒƒ ๋จธ์‹ ๋Ÿฌ๋‹(ML): ์ž… ahnty0122.tistory.com Machine Learning Workflow ์œ„์˜ ๊ธ€์—์„œ ์ธ๊ณต์‹ ๊ฒฝ๋ง์„ ํ•™์Šตํ•˜๋Š” ๊ณผ์ •์— ๋Œ€ํ•ด ์ ์–ด๋†จ๋Š”๋ฐ, ์ด ์ธ๊ณต์‹ ๊ฒฝ๋ง์„ ํ•™์Šตํ•˜๋Š” ๋ฐฉ๋ฒ•์— ๋Œ€ํ•ด ์•Œ์•„๋ณด์ž. ๋ฏธ๋ฆฌ ์ •ํ•ด์ง„, ๋‹ต์ด ์žˆ๋Š” ํŠธ๋ ˆ์ด๋‹ ๋ฐ์ดํ„ฐ๋กœ ๋จธ์‹ ๋Ÿฌ๋‹ ์•Œ๊ณ ๋ฆฌ์ฆ˜์„ ํ•™์Šตํ•ด ๋ชจ๋ธ์„ ์–ป๋Š” training(ํ•™์Šต)๊ณผ์ •์„ ๊ฑฐ์ณ ์˜ฌ๋ฐ”๋ฅด๊ฒŒ ํ•™์Šต์ด ๋˜๋Š”์ง€ ์ž„์˜์˜ ์ž…๋ ฅ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๊ฒฐ๊ณผ๋ฅผ ํ™•์ธํ•˜๋Š” test ๊ณผ์ •์„ ..

[Deep Learning] ๋”ฅ๋Ÿฌ๋‹/๋จธ์‹ ๋Ÿฌ๋‹/์ธ๊ณต์ง€๋Šฅ์˜ ์ฐจ์ด, ์ธ๊ณต์‹ ๊ฒฝ๋ง ๊ฐœ๋…

๋”ฅ๋Ÿฌ๋‹์— ๋Œ€ํ•ด ์ดํ•ดํ•˜๋ ค๋ฉด ๋จผ์ € ์ธ๊ณต์ง€๋Šฅ, ๋จธ์‹ ๋Ÿฌ๋‹, ๋”ฅ๋Ÿฌ๋‹์˜ ์ฐจ์ด์ ์„ ์•Œ์•„์•ผํ•œ๋‹ค. ์ธ๊ณต์ง€๋Šฅ(AI): ์‚ฌ๋žŒ์˜ ์ง€๋Šฅ์„ ๋ชจ๋ฐฉํ•ด ์‚ฌ๋žŒ์ด ํ•˜๋Š” ๋ณต์žกํ•œ ์ผ์„ ํ•  ์ˆ˜ ์žˆ๋Š” ๊ธฐ๊ณ„๋ฅผ ๋งŒ๋“œ๋Š” ๊ฒƒ ๋จธ์‹ ๋Ÿฌ๋‹(ML): ์ž…๋ ฅ๋˜์–ด ์žˆ๋Š” ๋ฐ์ดํ„ฐ๋ฅผ ํ†ตํ•ด ๊ฐ๊ฐ์˜ ์ƒํ™ฉ์— ๋งž๋„๋ก ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒƒ ๋”ฅ๋Ÿฌ๋‹(DL): ๋จธ์‹ ๋Ÿฌ๋‹์œผ๋กœ ํ•™์Šตํ•œ ๋ถ€๋ถ„๋“ค์„ ๊ธฐ๋ฐ˜์œผ๋กœ ์Šค์Šค๋กœ ํŒ๋‹จํ•ด ์ƒˆ๋กญ๊ฒŒ ๋Œ€์ฒ˜ํ•˜๋Š” ๊ฒƒ (=end-to-end machine learning) ๋”ฅ๋Ÿฌ๋‹์ด end-to-end machine learning์ธ ์ด์œ ๋Š” ์ฒ˜์Œ๋ถ€ํ„ฐ ๋๊นŒ์ง€ ์ปดํ“จํ„ฐ๊ฐ€ ํ•™์Šตํ•˜๋Š” ๊ฒƒ์œผ๋กœ, ๋จธ์‹ ๋Ÿฌ๋‹์ฒ˜๋Ÿผ ์‚ฌ๋žŒ์ด ๋ฐ์ดํ„ฐ์—์„œ ์ผ๋ถ€ ํŠน์ง•์„ ๋ฝ‘์•„์„œ ๊ธฐ๊ณ„์—๊ฒŒ ์ „๋‹ฌํ•ด ํ•™์Šต์‹œํ‚ค๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๋ผ ๋ฐ์ดํ„ฐ ์ž์ฒด๋ฅผ ์ปดํ“จํ„ฐ์— ์ „๋‹ฌํ•˜๋Š” ๋“ฑ ์‚ฌ๋žŒ์˜ ๊ฐœ์ž…์ด ์•„์˜ˆ ์—†๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค. ์ธ๊ณต์ง€๋Šฅ, ๋จธ์‹ ๋Ÿฌ๋‹, ๋”ฅ๋Ÿฌ๋‹์˜ ์ •์˜๋ฅผ ..

728x90