BigData & NoSQL ( ).pdf - BigData NoSQL DBMSs Tecnologie delle Basi di Dati M lucidi a cura del prof Torlone(univ Roma3 Big data Why

BigData & NoSQL ( ).pdf - BigData NoSQL...

This preview shows page 1 - 10 out of 165 pages.

BigData & NoSQL DBMSs Tecnologie delle Basi di Dati M lucidi a cura del prof. Torlone (univ. Roma3)
Image of page 1
Big data? Why? Why not just data? Well, because they are: 1. Big “The greater the struggle, the more glorious the triumph.” (Butterfly Circus) 2. Necessary “It is a capital mistake to theorize before one has data.” (S. Holmes) 3. Fashionable “I always wanted to be fashionable.” (J. Malkovich) 4. Profitable “Data is a precious thing and will last longer than the systems themselves.” (T. Berners-Lee) 5. Exciting “The most exciting phrase to hear in science, is not ‘Eureka!’, but ‘That's funny’…” (I. Asimov) Tecnologie delle Basi di Dati M 2 BigData & NoSQL
Image of page 2
Goals Show state-of-the-art techniques for dealing with collections of unstructured data whose size exceeds the capacity of storage, management, and analysis typical for traditional (relational) database systems In particular: Requirements for modern applications Problems with big data Available hardware/software solutions BigData & NoSQL Tecnologie delle Basi di Dati M 3
Image of page 3
Roadmap Introduction Terminology, principal characteristics, and application samples Storing Big Data Hadoop & Map-reduce Cloud computing NoSQL DBMS …but there’s more! Big data computing (high-level tools like Pig/Hive) Big data analysis (technologies like Mahout/Open R) Applications (Semantic web/open data/social networks/genomic data) simply not enough time… BigData & NoSQL Tecnologie delle Basi di Dati M 4
Image of page 4
Big Data? Different definitions! “Big data exceeds the reach of commonly used hardware environments and software tools to capture, manage, and process it with in a tolerable elapsed time for its user population .” (Teradata Magazine article, 2011) “Big data refers to data sets whose size is beyond the ability of typical database software tools to capture, store, manage and analyze .” (The McKinsey Global Institute, 2012) “Big data is a term for data sets that are so large or complex that traditional data processing applications are inadequate .” (Wikipedia, 2016) BigData & NoSQL Tecnologie delle Basi di Dati M 5
Image of page 5
When data become “Big”? BigData & NoSQL Tecnologie delle Basi di Dati M 6 IOPS: Input/Output Operations Per Second Normal processing capability IOPS BIG DATA Data volume
Image of page 6
Some numbers How many data in the world? 800 Terabytes, 2000 160 Exabytes, 2006 (1EB = 10 18 B) 500 Exabytes, 2009 2.7 Zettabytes, 2012 (1ZB = 10 21 B) 35 Zettabytes by 2020 How much is a zettabyte? 1,000,000,000,000,000,000,000 bytes A stack of 1TB hard disks that is 25,400 km high How many data in a day? 7 TB, Twitter 10 TB, Facebook 90% of world's data: generated over last two years! BigData & NoSQL Tecnologie delle Basi di Dati M 7
Image of page 7
The three "V’s" of Big Data Not just a matter of volume… BigData & NoSQL Tecnologie delle Basi di Dati M 8
Image of page 8
What is more important?
Image of page 9
Image of page 10

You've reached the end of your free preview.

Want to read all 165 pages?

  • Summer '15
  • Books

What students are saying

  • Left Quote Icon

    As a current student on this bumpy collegiate pathway, I stumbled upon Course Hero, where I can find study resources for nearly all my courses, get online help from tutors 24/7, and even share my old projects, papers, and lecture notes with other students.

    Student Picture

    Kiran Temple University Fox School of Business ‘17, Course Hero Intern

  • Left Quote Icon

    I cannot even describe how much Course Hero helped me this summer. It’s truly become something I can always rely on and help me. In the end, I was not only able to survive summer classes, but I was able to thrive thanks to Course Hero.

    Student Picture

    Dana University of Pennsylvania ‘17, Course Hero Intern

  • Left Quote Icon

    The ability to access any university’s resources through Course Hero proved invaluable in my case. I was behind on Tulane coursework and actually used UCLA’s materials to help me move forward and get everything together on time.

    Student Picture

    Jill Tulane University ‘16, Course Hero Intern

Stuck? We have tutors online 24/7 who can help you get unstuck.
A+ icon
Ask Expert Tutors You can ask You can ask You can ask (will expire )
Answers in as fast as 15 minutes