Bookmarks for March 5th through March 12th

These are my links for March 5th through March 12th:

  • R Videos – Online instructional videos for R.
  • Flickr: Creative Commons – Many Flickr users have chosen to offer their work under a Creative Commons license, and you can browse or search through content under each type of license.
  • Welcome to Apache Hadoop! – The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing. Hadoop includes these subprojects:
    * Hadoop Common: The common utilities that support the other Hadoop subprojects.
    * Avro: A data serialization system that provides dynamic integration with scripting languages.
    * Chukwa: A data collection system for managing large distributed systems.
    * HBase: A scalable, distributed database that supports structured data storage for large tables.
    * HDFS: A distributed file system that provides high throughput access to application data.
    * Hive: A data warehouse infrastructure that provides data summarization and ad hoc querying.
    * MapReduce: A software framework for distributed processing of large data sets on compute clusters.
    * Pig: A high-level data-flow language and execution framework for parallel computation.
    * ZooKeeper: A high-performance coordination service for distributed applications.

Leave a comment

Your email address will not be published. Required fields are marked *