Дәріс №6
Тақырыбы: Data analysis. Data management
Сағат саны: 1
Тақырыптың негізгі сұрақтары/ жоспары:
Data analysis
Data Mining Bases
Дәріс тезисі:
Data analysis is a process of data research, filtration, conversion and modeling to extract the useful information and decision-making. Data analysis has a set of aspects and approaches, covers different methods in different fields of science and activities.
To create the plan of data collection it is necessary to:
Define problems and formulate research objectives.
Realize a preliminary study of an interesting subject.
Develop concepts of research.
Make detail planning of a research.
Make a selection of information sources and collection of secondary data.
Estimate the obtained data and to make a decision how primary data are needed.
Define a method of collection of primary data – inquiry, observation, experiment.
Carry out directly the collection of primary information.
Provide results of the research (presentation).
Decision trees – it the method of rules representation in hierarchical, sequential structure where a single node giving a decision corresponds to each object there.
All problems, which are solved by a tree method, can be integrated in the following three classes:
1) Data description: Decision trees allow storing information about data in the compact form, instead of them we can store a decision tree, which contains the exact description of objects.
2) Classification: Decision trees perfectly cope with tasks of classification, i.e. reference of objects to one of in advance known classes. The target variable shall have discrete values.
3) Regression: If a goal variable has continuous values, decision trees allow setting the dependence of the goal variable on independent (input) variables. For example, the tasks of numerical prediction (prediction of values of a goal variable) belong to this class.
Data Mining can consist of two or three stages.
Stage 1. Detection of regularities (free search).
Stage 2. Use of revealed regularities for prediction of unknown values (prognostic simulation).
Stage 3. Analysis of exceptions is the stage intended for detection and explanation of anomalies which are found in regularities.
Бекіту сұрақтары
What is data analysis?
What is data?
How can forecasting data methods be classified?
What is the regression in data analysis?
What is data visualization?
What is Data Mining?
What are data mining methods?
What is a decision tree?
What are the tasks of Data mining?
What are the algorithms of creation of a decision tree?
Әдебиет: Негізгі[1-5], қосымша [1-5]
Достарыңызбен бөлісу: |