전체 페이지뷰

2014년 12월 22일 월요일

Big Data Glossary

  1. Chapter 1 Terms

    1. Document-Oriented

    2. Key/Value Stores

    3. Horizontal or Vertical Scaling

    4. MapReduce

    5. Sharding

  2. Chapter 2 NoSQL Databases

    1. MongoDB

    2. CouchDB

    3. Cassandra

    4. Redis

    5. BigTable

    6. HBase

    7. Hypertable

    8. Voldemort

    9. Riak

    10. ZooKeeper

  3. Chapter 3 MapReduce

    1. Hadoop

    2. Hive

    3. Pig

    4. Cascading

    5. Cascalog

    6. mrjob

    7. Caffeine

    8. S4

    9. MapR

    10. Acunu

    11. Flume

    12. Kafka

    13. Azkaban

    14. Oozie

    15. Greenplum

  4. Chapter 4 Storage

    1. S3

    2. Hadoop Distributed File System

  5. Chapter 5 Servers

    1. EC2

    2. Google App Engine

    3. Elastic Beanstalk

    4. Heroku

  6. Chapter 6 Processing

    1. R

    2. Yahoo! Pipes

    3. Mechanical Turk

    4. Solr/Lucene

    5. ElasticSearch

    6. Datameer

    7. BigSheets

    8. Tinkerpop

  7. Chapter 7 NLP

    1. Natural Language Toolkit

    2. OpenNLP

    3. Boilerpipe

    4. OpenCalais

  8. Chapter 8 Machine Learning

    1. WEKA

    2. Mahout

    3. scikits.learn

  9. Chapter 9 Visualization

    1. Gephi

    2. GraphViz

    3. Processing

    4. Protovis

    5. Fusion Tables

    6. Tableau

  10. Chapter 10 Acquisition

    1. Google Refine

    2. Needlebase

    3. ScraperWiki

  11. Chapter 11 Serialization

    1. JSON

    2. BSON

    3. Thrift

    4. Avro

    5. Protocol Buffers

댓글 없음:

댓글 쓰기