/Data_Analysis

๐Ÿ“ˆ Data Science Using Python

Primary LanguageJupyter Notebook

Data_Analysis

1-1. Nhn ๋‰ด์Šค์ œ๋ชฉ์ถ”์ถœ ๋ฒˆ์—ญ API_์›นํˆฐ์ด๋ฏธ์ง€.ipynb

  • ๋„ค์ด๋ฒ„ ๋‰ด์Šค ์ œ๋ชฉ ์ถ”์ถœ
  • ์›นํˆฐ ์ด๋ฏธ์ง€ ํฌ๋กค๋ง
  • Papago API ํ˜ธ์ถœ

โ€‹

1-2. Nhn ๋‰ด์Šค์ œ๋ชฉ์ถ”์ถœ_์›นํˆฐ ์ด๋ฏธ์ง€ ๋‹ค์šด๋กœ๋“œ.ipynb

  • ๋„ค์ด๋ฒ„ ๋‰ด์Šค ์ œ๋ชฉ์ถ”์ถœ
  • ์›นํˆฐ ์ด๋ฏธ์ง€ ๋‹ค์šด๋กœ๋“œ
  • ํŠน์ • ์›นํˆฐ ํŽ˜์ด์ง€์˜ ๋ชจ๋“  Image๋ฅผ ๋‹ค์šด๋กœ๋“œ ๋ฐ›๊ธฐ
  • ๋„ค์ด๋ฒ„ ์›นํˆฐ ํ™ˆ์—์„œ ์ถ”์ฒœ์›นํˆฐ์˜ ์ œ๋ชฉ๊ณผ url์„ ์•Œ์•„๋‚ด๊ธฐ

2. ๊ธฐ์ƒ์ฒญ ๋‚ ์”จ ๋ฐ์ดํ„ฐ ์ถ”์ถœ.ipynb

  • BeautifulSoup์„ ์ด์šฉํ•ด์„œ ๋‚ ์”จ ๋ฐ์ดํ„ฐ ๊ฐ€์ ธ์˜ค๊ธฐ

3. ๋ฉœ๋ก 100Chart ์ˆ˜์ง‘ ๋ถ„์„ ์ €์žฅ.ipynb

  • 100๊ณก์˜ ๋…ธ๋ž˜์˜ ์ œ๋ชฉ, Song ID์ถ”์ถœ (์ •๊ทœํ‘œํ˜„์‹ ์‚ฌ์šฉ)
  • Song ID๋กœ ์ƒ์„ธ ํŽ˜์ด์ง€๋ฅผ 100๋ฒˆ ์š”์ฒญํ•ด์„œ ๋…ธ๋ž˜ ์ƒ์„ธ์ •๋ณด ์ถ”์ถœ
  • 100๊ณก์˜ ์ƒ์„ธ ์ •๋ณด๋ฅผ JSON File์— ์ €์žฅ
  • json file์„ Pandas์˜ DataFrame ๊ฐ์ฒด์— ์ €์žฅํ•ด์„œ ํ‘œ ๋ฐ์ดํ„ฐ๋ฅผ ๋งŒ๋“ค๊ธฐ
  • ํ‘œ ๋ฐ์ดํ„ฐ๋ฅผ MariaDB์— ์ €์žฅํ•˜๊ธฐ

4. ํ–‰์ •๊ตฌ์—ญ์ •๋ณด๋ถ„์„_์‹œ๊ฐํ™”.ipynb

  • ํ–‰์ •๊ตฌ์—ญ์ •๋ณด(CSV) ์ฝ๊ณ  ๋ถ„์„
  • Pandas๋กœ csv์ฝ์–ด์˜ค๊ณ  ๋ชจ๋“ˆ ์‚ฌ์šฉํ•ด๋ณด๊ธฐ

5. ๊ตญํšŒ์˜์›ํ˜„ํ™ฉ ์Šคํฌ๋ž˜ํ•‘ ๋ถ„์„ ์‹œ๊ฐํ™” ์ €์žฅ.ipynb

  • BeautifulSoup ์Šคํฌ๋ž˜ํ•‘
  • ๊ตญํšŒ์˜์› ์ด๋ฆ„, ID ์ถ”์ถœ
  • ์ƒ์„ธ์ •๋ณด ํ‘œ๋ฐ์ดํ„ฐ ๋งŒ๋“ค๊ธฐ
  • ์‹œ๊ฐํ™” (๋ง‰๋Œ€๊ทธ๋ž˜ํ”„, ํžˆ์Šคํ† ๊ทธ๋žจ, ํŒŒ์ด์ฐจํŠธ, ํžˆํŠธ๋งต)
  • DB์— members ํ…Œ์ด๋ธ”์— ์ €์žฅ (pymysql, sqlalchemy)

6. seoul-bike-01.ipynb

  • ์„œ์šธํŠน๋ณ„์‹œ ๊ณต๊ณต์ž์ „๊ฑฐ ๋”ฐ๋ฆ‰์ด ๋Œ€์—ฌ์ด๋ ฅ ๋ถ„์„
  • pandas, plotnine, missingno, matplotlib, ggplot2, folium

7. seoul-bike-01.html

  • ๋”ฐ๋ฆ‰์ด ์‹œ๊ฐํ™” ์ž˜ ๋ณด๊ธฐ์œ„ํ•ด html๋กœ ์ปจ๋ฒ„ํŒ…ํ•จ