jagomart
digital resources
picture1_Analysis Ppt 75510 | Datastrategynov6


 170x       Filetype PPTX       File size 0.36 MB       Source: cse.buffalo.edu


File: Analysis Ppt 75510 | Datastrategynov6
high level goals for the course 2 understand foundations of data analytics so that you can interpret and communicate results and make informed decisions study and learn to apply common ...

icon picture PPTX Filetype Power Point PPTX | Posted on 02 Sep 2022 | 3 years ago
Partial capture of text on file.
          High Level Goals for the course
                                     2
    
     Understand foundations of data analytics so that you can 
     interpret and communicate results and make informed 
     decisions 
    
     Study and learn to apply common statistical methods and  
     machine learning algorithms to solve business problems
    
     Learn to work with popular tools to analyze and visualize data; 
     more importantly encourage consistency across departments 
     on analytics/tools used
    
     Working with cloud for data storage and for deployment of 
     applications 
    
     Learn methods for mastering and applying emerging concepts 
     and technologies for continuous data-driven improvements 
     to your business processes
    
     Transform complex analytics into routine processes
    Rich's Data Analytics Training                             09/01/2022
                           Motivation
                                    3
    
      Tremendous advances have taken place in 
      statistical methods and tools, machine learning 
      and data mining approaches, and internet based 
      dissemination tools for analysis and visualization.
    
      Many tools are open source and freely available 
      for anybody to use.
    
      Is there an easy entry-point into learning these 
      technologies?
    
      Can we make these tools easily accessible to the 
      decision makers similar to how “office” 
      productivity software is used?
    Rich's Data Analytics Training                          09/01/2022
                    Newer kinds of Data
                                     4
    
     New kinds of data from different sources (see p.23 of Data 
     Science book) : tweets, geo location, emails, blogs
    
     Two major types: structured and unstructured data
    
     Structured data: data collected and stored according to well 
     defined schema;  Realtime stock quotes
    
     Unstructured data: messages from social media, news, 
     talks, books, letters, manuscripts, court documents..
    
     “Regardless of their differences, they work in tandem in any 
     effective big data operation. Companies wishing to make 
     the most of their data should use tools that utilize the 
     benefits of both.”5
    
     We will discuss methods for analyzing both structured and 
     unstructured data
    Rich's Data Analytics Training                            09/01/2022
                    Top Ten Largest Databases
          7000
          6000
          5000
           Terabytes
          4000
                          Top ten largest databases (2007)
          3000
          2000
          1000
            0
                 LOC     CIA    Amazon YOUTube ChoicePt   Sprint  Google   AT&T    NERSC    Climate
       Ref: http://www.comparebusinessproducts.com/fyi/10-largest-databases-in-the-world/
      Rich's Data Analytics Training                       5                                           09/01/2022
     Top Ten Largest Databases in 2007 vs 
                            Facebook ‘s cluster in 2010
                                                                                                        21 PetaByte
                                                                                                        In 2010
          7000
          6000
          5000
          4000
          Terabytes
          3000
                             Top ten largest databases (2007)
          2000
          1000
            0
                 LOC     CIA    Amazon YOUTube ChoicePt   Sprint  Google   AT&T    NERSC    Climate Facebook
       Ref: http://www.comparebusinessproducts.com/fyi/10-largest-databases-in-the-world
      Rich's Data Analytics Training                       6                                           09/01/2022
The words contained in this file might help you see if this file matches what you are looking for:

...High level goals for the course understand foundations of data analytics so that you can interpret and communicate results make informed decisions study learn to apply common statistical methods machine learning algorithms solve business problems work with popular tools analyze visualize more importantly encourage consistency across departments on used working cloud storage deployment applications mastering applying emerging concepts technologies continuous driven improvements your processes transform complex into routine rich s training motivation tremendous advances have taken place in mining approaches internet based dissemination analysis visualization many are open source freely available anybody use is there an easy entry point these we easily accessible decision makers similar how office productivity software newer kinds new from different sources see p science book tweets geo location emails blogs two major types structured unstructured collected stored according well defined s...

no reviews yet
Please Login to review.