OneClick Mining

Software GUI

Description of the GUI elements.

Click on an element of the image to show its description.

Summary of our project

During the last years, the data storage capacity in computers has skyrocketed. The gathering of huge quantities of informations has become usual. It is now really easy to have access to these huge data quantities : the temperature about regions of the world during the last ten years, the purchase done by customers in shops, the results of a survey about any problem, etc. However, those data are raw and unprocessed, often without labels or with bad ones, even if those data contains a large volume of important informations.

Schematic of the different steps of data mining

The data mining is a complex process which consist of the gathering then the processing of data to only keep the most relevant parts : these are the selection step and the preprocess step. The data extracted are then used as input for severals algorithms during a mining step. The results obtained are processed one more time during the postprocessing step in order to keep only the relevant ones with a pattern structure. It is the mining step which may cause a problem : we need to select the most appropriate algorithms and choose their parameters so those algorithms perform well and give interesting results for the user.

This project for forth-year students is a data mining software, adapted to suit a user who has no experience in data mining. That user would only need to press a unique button to get the results. This concept is called OneClick Mining and is presented in the research article (1)One Click Mining - Interactive Local Pattern Discorvery through Implicit Preference and Performance Learning.

Learning cycle and Mining cycle

Learning cycle and Mining cycle, click on an element of the image

General working

The drawing describes the general functionning of the software OneClick Mining. First, it is made of the user part which was presented earlier, then the internal functionning.
For each click on the Mining button made by the user, the utility function is updated from the list of pattern which the user has judged interesting and which he has deleted. That function could be seen as a snapshot about the preferences of the user. Applied to a pattern and its interestingness measures, that function will tell us with that snapshot if the software thinks that the user would find the pattern interesting or not. Those interestingness measures are values describing the associated pattern while evaluating its relevance according to severals criterias such as the number of attributes. There are numerous measures and the choice of the measure is made according to the used algorithm. That function is calculated for each click of the user on the button Mining, thus is updated from the user experience on the previous patterns shown to him during the last turn.
A new learning cycle is then launched. During this learning cycle, numerous mining cycle are running : several data mining algorithms are launched one after another. Only one algorithm is launched per mining cycle. Those algorithms produce patterns which are shown to the user the next time he clicks on the button Mining. For the software OneClick Mining, a pattern is a pair of values : the pattern itself and its interestingness measures.

OneClick Mining

Software GUI

Summary of our project

Our Team

Laurence ROZE

Ibamar BA

Francesco BARIATTI

Pierre Nicolas EUDE

Violaine FABRY

Gregrory MARTIN

Marie LOUP

Louis-Marie RENAUD

Learning cycle and Mining cycle

General working

Bibliography