Downloads
rminer - A open-source library that facilitates the use of Data Mining techniques in R
This package was used in several Data Mining applications: intensive care medicine, meat and wine quality assessment, civil engineering, forest fires prediction, modeling student performance, time series forecasting, spam e-mail detection and others. Also available at Comprehensive R Archive Network (CRAN). The rminer package tutorial pdf is available here and the respective code is available here.
Springer Book: Modern Optimization with R (R code, data, ...)
Benchmark of AutoML tools: IJCNN 2021 paper results, data & code
Datasets:
Forest Fires (regression, donated to the UCI Machine Learning (ML) repository).
Wine Quality (regression/classification) donated to the UCI ML repository).
S-Enron corpus (personalized spam e-mail classification, 5 users, "Date:" field should be used to mix the ham and spam messages, see this paper: Elsevier or RepositoriUM).
Bank Marketing (classification, donated to UCI ML repository).
Internet Traffic Time Series Datasets (time series forecasting, see this paper: Wiley or RepositoriUM; also available at tsdl R package - index 643 to 648).
Input importance synthetic datasets (regression and classification, eXplainable Artificial Intelligence - XAI, see this paper: Elsevier or RepositoriUM).
Student Performance (regression/classification, donated to the UCI ML repository).
Stock Market Lexicon (with more than 20.000 microblog terms associated with positive or negative scores, available at GitHub).
Online News Popularity (regression/classification, donated to the UCI ML repository).
CS Abstracts Dataset (sequential classification).
Twitter-country-geolocation (classification).
Cross-source cross-domain sentiment analysis (classification).