๐๊ด์ฌ๋ถ์ผ
- Hadoop, Spark, Kafka, Docker ...
- ๋์ฉ๋ ๋ฐ์ดํฐ ์ฒ๋ฆฌ, ๋ถ์ฐ ์์คํ , ๋ฐ์ดํฐ ๋ถ์
๐บ๋ธ๋ก๊ทธ ๋ฐ์ดํฐ ์์ง๋์ด๊ฐ ๋๊ธฐ์ํด ๊ณต๋ถํ๊ณ ์๋ ๋ชจ๋ ๊ฒ!!
๐ํ๋ก์ ํธ ์น ๊ฐ๋ฐ/๋ฐ์ดํฐ ๋ถ์/์ถ์ฒ์์คํ /๋ฅ๋ฌ๋ ๋ฑ
๐ธCS ์๋ฃ๊ตฌ์กฐ/์๊ณ ๋ฆฌ์ฆ/์ปดํจํฐ๊ตฌ์กฐ/์ด์์ฒด์ /๋คํธ์ํฌ/๋ฐ์ดํฐ๋ฒ ์ด์ค ๋ฑ
๐ง๋ ผ๋ฌธ๋ฆฌ๋ทฐ
- Piranha : Optimizing Short Jobs in Hadoop, Elmeleegy K
- Robert H Bonczek, Clyde W Holsapple, and Andrew B Whinston. Foundations of decision support systems. Academic Press, 2014.
- Yingyi Bu, Bill Howe, Magdalena Balazinska, and Michael D Ernst. Haloop: efficient iterative data processing on large clusters. Proceedings of the VLDB Endowment,
- An Experimental Comparison of Pregel-like, Systems G Han M Daudjee K Ammar KOzsu M Wang X Jin T
- Twister : A Runtime for Iterative MapReduce, Ekanayake J Li H Zhang B Gunarathne TBae S Qiu J Fox G
- The Hadoop Distributed File System, Shvachko K Kuang H Radia S Chansler
- MapReduce : Simplified Data Processing on Large Clusters, Dean J Ghemawat S
- Jeffrey Dean and Sanjay Ghemawat. Mapreduce: simplified data processing on large clusters. Communications of the aCM, 51(1):107โ113, 2008.
- Hive: a warehousing solution over a map-reduce framework. Proceedings of the VLDB
- MapReduce Online, Condie T Conway N Alvaro P Hellerstein JElmeleegy K Sears R
- PACMan: Coordinated memory caching for parallel jobs, Ananthanarayanan G Ghodsi A Wang A
- Hive: a warehousing solution over a map-reduce framework
- Resilient Distributed Datasets : A Fault-Tolerant Abstraction for In-Memory Cluster Computing, Zaharia M Chowdhury M Das T Dave A Ma JMccauley M
- Flink Forward conference in Berlin. Flink vs spark slideshare. http://www.slideshare.net/sbaltagi/flink-vs-spark? related=2.
- Resilient Distributed Datasets : A Fault-Tolerant Abstraction for In-Memory Cluster Computing, Zaharia M Chowdhury M Das T Dave A Ma JMccauley M Franklin M
- Streaming Data Analysis using Apache Cassandra and Zeppelin
- Analysis of Hadoop performance and unstructured data using Zeppelin
- Haloop efficient iterative data processing on large clusters
- iMapReduce: A Distributed Computing Framework for Iterative Computation
- Improving MapReduce Performance in Heterogeneous Environments