jagomart
digital resources
picture1_Study Pdf 88850 | Data Warehousing And Data Mining (r18a0524)


 125x       Filetype PDF       File size 2.69 MB       Source: mrcet.com


File: Study Pdf 88850 | Data Warehousing And Data Mining (r18a0524)
digital notes on data warehousing and data mining r18a0524 b tech iii year ii sem 2020 21 department of information technology malla reddy college of engineering technology autonomous institution ugc ...

icon picture PDF Filetype PDF | Posted on 15 Sep 2022 | 3 years ago
Partial capture of text on file.
           
                                                    DIGITAL NOTES 
                                                                  ON 
                       DATA WAREHOUSING AND DATA MINING 
                                                      (R18A0524) 
           
                                         B.TECH III Year - II Sem 
                                                        (2020-21) 
           
           
           
           
           
           
           
                                                                               
           
           
                      DEPARTMENT OF INFORMATION TECHNOLOGY 
           
           
            MALLA REDDY COLLEGE OF ENGINEERING &TECHNOLOGY 
                                    (Autonomous Institution – UGC, Govt. of India) 
                                                 Sponsored by CMR Educational Society 
                             (Affiliated to JNTU, Hyderabad, Approved by AICTE- Accredited by NBA& NAAC–‘A’Grade-ISO9001:2008Certified) 
                                   Maisammaguda,Dhulapally(PostViaHakimpet),Secunderabad–500100,TelanganaState,India. 
                              Contact Number: 040-23792146/64634237, E-Mail ID: mrcet2004@gmail.com, website: www.mrcet.ac.in 
           
           
            
            
                                     MALLA REDDY COLLEGE OF ENGINEERING & TECHNOLOGY 
                                                  DEPARTMENT OF INFORMATION TECHNOLOGY 
            
                                                                                  SYLLABUS 
                    III Year B. Tech. IT –II Sem                                                                              L          T/P/ C 
                                                                                                                              3        -/- /-     3 
            
                                      (R18A0524) DATA WAREHOUSING AND DATA MINING 
            
                  Objectives: 
             1.  Study data warehouse principles and its working 
             2.  Learn Data mining concepts and understand Association Rule Mining 
             3.  Study Classification Algorithms 
             4.  Gain knowledge of how data is grouped using clustering techniques. 
            
            
               UNIT-I 
               Data warehouse: Introduction to Data warehouse, Difference between operational database systems and 
               data  warehouses,  Data warehouse  Characteristics,  Data  warehouse  Architecture  and  its  Components, 
               Extraction-Transformation-Loading,  Logical(Multi-Dimensional),  Data  Modeling,  Schema  Design,  Star 
               and Snow-Flake Schema, Fact Constellation, Fact Table, Fully Addictive, Semi-Addictive, Non Addictive 
               Measures;  Fact-Less-Facts,  Dimension  Table Characteristics;  OLAP  Cube,  OLAP  Operations,  OLAP 
               Server Architecture-ROLAP, MOLAP and HOLAP. 
            
               UNIT-II 
               Introduction: Fundamentals of data mining, Data Mining Functionalities, Classification of Data Mining 
               systems,  Data  Mining  Task  Primitives,  Integration of  a  Data  Mining  System  with  a  Database  or  Data 
               Warehouse System, Major issues in Data Mining. 
               Data Preprocessing: Need for Preprocessing the Data, Data Cleaning, Data Integration &Transformation, 
               Data Reduction, Discretization and Concept Hierarchy Generation. 
            
               UNIT-III 
               Association Rules: Problem Definition, Frequent Item Set Generation, The APRIORI Principle, Support 
               and Confidence Measures, Association Rule Generation; APRIOIRI Algorithm, The Partition Algorithms, 
               FP-Growth Algorithms, Compact Representation  of Frequent Item Set-  Maximal Frequent Item Set,  Closed 
               Frequent Item Set. 
            
            
     UNIT-IV 
     Classification: Problem Definition, General Approaches to solving a classification problem, Evaluation of 
     Classifiers , Classification techniques, Decision Trees-Decision tree Construction, Methods for Expressing 
     attribute test conditions, Measures for Selecting the Best Split, Algorithm for Decision tree Induction ; 
     Naive-Bayes Classifier,  Bayesian  Belief  Networks;  K-  Nearest  neighbor  classification-Algorithm  and 
     Characteristics. 
     Prediction: Accuracy and Error measures, Evaluating the accuracy of classifier or a predictor, Ensemble 
     methods 
     
     UNIT-V 
     Clustering: Clustering Overview, A  Categorization  of  Major  Clustering Methods, Partitioning  Methods, 
     Hierarchical  Methods,  ,  Partitioning  Clustering-K-Means  Algorithm,  PAM  Algorithm;  Hierarchical 
     Clustering-Agglomerative  Methods  and  divisive  methods,  Basic  Agglomerative  Hierarchical  Clustering 
     Algorithm, Key Issues in Hierarchical Clustering, Strengths and Weakness, Outlier Detection. 
     
     TEXT BOOKS: 
     1)  Data Mining- Concepts and -1.chniques- Jiawei Han, Micheline Kamber, Morgan Kaufmann 
       Publishers, Elsevier, 2 Edition, 2006. 
     2)  Introduction to Data Mining, Psng-Ning Tan, Vipin Kumar, Michael Steinbanch, Pearson Educatior. 
     
     REFERENCE BOOKS: 
     1)  Data Mining Techniques, Arun KPujari, 3rd Edition, Universities Press. 
     2)  Data Warehousing Fundament's, Pualraj Ponnaiah, Wiley Student Edition. 
     3)  The Data Warehouse Life CycleToolkit — Ralph Kimball, Wiley Student Edition. 
     4)  Data Mining, Vikaram Pudi, P Rddha Krishna, Oxford University Press 
     
     Outcomes: 
     •  Comparison of  functional differences between data warehouse and database systems. 
     •  Ability to perform the pre-processing of data and apply mining techniques on it. 
     •  Capability to identify the association rules, classification and clusters in large data sets. 
     •  Skills to solve real world problems in business and scientific information using data mining.
     
     
          
          
                               MALLA REDDY COLLEGE OF ENGINEERING & TECHNOLOGY 
                                         DEPARTMENT OF INFORMATION TECHNOLOGY 
                                                                             INDEX 
          
                                Unit                                Contents                                 Pg.No 
                                         Introduction to Data warehouse                                         1 
                                         Data warehouse Design and Architecture                                 2 
                                  I      Data warehouse Modelling,                                              3 
                                         Schema Design                                                          6 
                                         Measures                                                               9 
                                         OLAP                                                                  10 
                                         Fundamentals of data mining                                           12 
                                         Data Mining Functionalities                                           13 
                               
                                  II     Classification of Data Mining                                         16 
                                         Major Issues in  Data Mining                                          19 
                                         Data Preprocessing                                                    23 
                                         Association Rule Mining                                               26 
                                         Frequent Item set generation                                          29 
                               
                                 III     Apriori Algorithm                                                     30 
                                         FP growth Algorithm                                                   34 
                                         Compact Representation of Frequent Item set                           37 
                                         Classification : General approaches                                   43 
                                         Decision Tree Algorithm                                               45 
                               
                                 IV      Naïve Bayes Classifier                                                49 
                                        K-Nearest Neighbor classification                                      56 
                                         Prediction: Accuracy & Error Methods                                  60 
                                         Ensemble methods                                                      62 
                                         Clustering Overview                                                   64 
                                         A categorization of major Clustering Methods                          67 
                               
                                  V      Partitioning clustering_ K-Means Algorithm                            71 
                                         Hierarchical Clustering                                               76 
                                         Outlier Detection                                                     78 
          
          
The words contained in this file might help you see if this file matches what you are looking for:

...Digital notes on data warehousing and mining ra b tech iii year ii sem department of information technology malla reddy college engineering autonomous institution ugc govt india sponsored by cmr educational society affiliated to jntu hyderabad approved aicte accredited nba naac a grade iso certified maisammaguda dhulapally postviahakimpet secunderabad telanganastate contact number e mail id mrcet gmail com website www ac in syllabus it l t p c objectives study warehouse principles its working learn concepts understand association rule classification algorithms gain knowledge how is grouped using clustering techniques unit i introduction difference between operational database systems warehouses characteristics architecture components extraction transformation loading logical multi dimensional modeling schema design star snow flake fact constellation table fully addictive semi non measures less facts dimension olap cube operations server rolap molap holap fundamentals functionalities ta...

no reviews yet
Please Login to review.