Data Preprocessing

Data Preprocessing


ย 
notion image
ย 

zero-centering

  • ๋ฐ์ดํ„ฐ์˜ ๋ชจ๋“  feature๋งˆ๋‹ค ํ‰๊ท ์œผ๋กœ ๋‚˜๋ˆ„๊ณ , ์œ„์˜ ๊ทธ๋ฆผ์ฒ˜๋Ÿผ zero-centered ๋ฐ์ดํ„ฐ๋กœ ๋งŒ๋“œ๋Š” ๊ณผ์ •์ด๋‹ค.
  • ์ „์ฒ˜๋ฆฌ ๊ณผ์ •์€ ์ „์ฒด ํ‰๊ท  ์ด๋ฏธ์ง€๋ฅผ ๋นผ๊ฑฐ๋‚˜, 3๊ฐœ ์ฑ„๋„(RGB)์˜ ๊ฐ๊ฐ ํ‰๊ท ์„ ๊ตฌํ•ด์„œ ๊ฐ ์ฑ„๋„๋ณ„๋กœ ํ‰๊ท ์„ ๋นผ์„œ ์ง„ํ–‰ํ•œ๋‹ค.
  • ์ด๋ฏธ์ง€์˜ ๊ฒฝ์šฐ ์ „์ฒ˜๋ฆฌ๋กœ zero-centering ์ •๋„๋งŒ ์‚ฌ์šฉํ•˜๊ณ  normalization ๊นŒ์ง€๋Š” ์‚ฌ์šฉ ํ•˜์ง€ ์•Š๋Š”๋‹ค. โ†’ ์ด๋ฏธ์ง€์˜ ๋ชจ๋“  ํ”ฝ์…€๋“ค์˜ scale์€ [0, 255]๋กœ ๊ฐ™๊ธฐ ๋•Œ๋ฌธ
  • ๋” ๋‚ฎ์€ ์ฐจ์›์œผ๋กœ ๊ฐ์†Œ์‹œํ‚ค๋Š” PCA๋‚˜ whitened data ์™€ ๊ฐ™์€ ๋ฐฉ๋ฒ•๋„ ์ด๋ฏธ์ง€ ์ฒ˜๋ฆฌ์—์„  ์‚ฌ์šฉํ•˜์ง€ ์•Š๋Š”๋‹ค. โ†’ CNN์—์„œ๋Š” ์›๋ณธ ์ด๋ฏธ์ง€ ์ž์ฒด์˜ spatial ์ •๋ณด๋ฅผ ์ด์šฉํ•ด์„œ ์ด๋ฏธ์ง€์˜ spatial structure๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ๋„๋ก ํ•œ๋‹ค.
  • Validation๊ณผ test ๋‹จ๊ณ„์—์„œ๋„ train set์˜ ํ‰๊ท , ํ‘œ์ค€ํŽธ์ฐจ๋ฅผ ์‚ฌ์šฉํ•˜์—ฌ ์ „์ฒ˜๋ฆฌํ•œ๋‹ค.
ย