MapReduce Tutorial: Easy Way to Understand the Process

edutechional
edutechional
1.7 هزار بار بازدید - 9 سال پیش - In this DevOps guide we're
In this DevOps guide we're going to walk through a MapReduce tutorial and learn how I could potentially use it for the CronDose website. MapReduce is one of the key processes associated with hadoop and gfs. For the case study we're going to imagine that my tutorial suggestion page has millions of content suggestions (not really, but let’s pretend). It would take too long to look at each suggestion manually, so I’m going to use a mapreduce algorithm to analyze and organize the data. I'd follow the steps below: 1. Pass each suggestion to the map method to tokenize (convert each suggestion into an array of words) 2. This map method will return a series of key/value elements (“algorithms”, suggestion_1, “Ajax”, suggestion_2, etc) 3. The mapreduce framework would sort the returned values 4. The reduce method would iterate over each of the values and tally up the popularity of each word. Follow me: Twitter: https://twitter.com/jordanhudgens Instagram: https://www.instagram.com/jordanhudgens/ GitHub: http://github.com/jordanhudgens Full guide: https://www.crondose.com/2016/04/dead-simple-explanation-of-mapreduce/
9 سال پیش در تاریخ 1395/02/08 منتشر شده است.
1,735 بـار بازدید شده
... بیشتر