A Novel Data Handling Technique for Wine Quality Analysis using ML Techniques
Journal Title: International Journal of Experimental Research and Review - Year 2024, Vol 45, Issue 9
Abstract
In this era, wine is a regularly redeemed beverage, and industries are seeing increased sales due to product quality certification. This research aims to identify key wine characteristics that contribute to significant outcomes through the application of machine learning classification techniques, specifically Random Forest (RF), Decision Tree (DT) and Multi-Layer Perceptron (MLP), using white and red wine datasets sourced from the UCI Machine Learning repository. This research aims to develop a multiclass classification model using machine learning (ML) to accurately assess the quality of a balanced wine dataset comprising both white and red wines. The dataset is balanced by random oversampling to avoid biases in ML techniques for the majority class obtained by the imbalanced multiclass dataset (IMD). Furthermore, we apply a Yeo-Jhonson transformation (YJT) to the datasets to reduce skewness. We validated the ML algorithm's result using a 10-fold cross-validation approach and found that RF yielded the highest overall accuracy of 93.14%, within a range of 75% to 94%. We have observed that the proposed approach for balanced white wine dataset accuracy is 93.14% using RF, 90.83% using DT, and 75.49% using MLP. Similarly, for the balanced red wine dataset, accuracy is 89.36% using RF, 85.36% using DT, and 78.00% using MLP. The proposed approach improves accuracy by RF 23%, DT 30%, and MLP 21% for the white wine dataset. Similarly, accuracy by RF remained the same, DT 10%, and MLP 22% is improved in the red wine dataset. Additionally, the proposed approach's RF, DT, and MLP yield mean squared error (MSE) values of 0.080, 0.151, and 0.443 for the white wine dataset and 0.143, 0.221, and 0.396 for the red wine dataset. We also observed that the RF accuracy for the proposed technique is the highest among all specified classifiers for white and red wine datasets, respectively.
Authors and Affiliations
Onima Tigga, Jaya Pal, Debjani Mustafi
Efficacy of Dry Needling in Enhancing Hand Function and Reducing Spasticity in MCA Stroke Patients: A Prospective Case Report
Stroke, particularly involving the middle cerebral artery (MCA), often leads to persistent disability, significantly impairing hand function and inducing severe spasticity in the forearm flexors due to complex motor path...
Evaluation of Antioxidant, Anti-inflammatory and Antimicrobial Potential of Aegel marmelos Fruit Pulp Extracts against Clinical Pathogens
In India, a wide range of medicinal plants are reported. Since ancient times, these medicinal plants have been used by people for the treatment of several diseases. Herbal medicines typically have fewer side effects comp...
Machine Learning Techniques for Medicinal Leaf Prediction and Disease Identification
Trees have been a crucial component in humans' lives for hundreds of years, providing food, shelter, and medicine. Some trees have a lot of medicinal properties that cure many diseases. In the old days, Ayurvedic methods...
Moth Bean (Vigna aconitifolia) as Potential Supplement to Evaluate the Weight Gain in Wistar Albino Rats (Rattus norvegicus)
Vigna aconitifolia is an essential crop in Indian agriculture, predominantly cultivated in India. It is acknowledged for its significant nutritional value and its affordability, making it a valuable dietary choice for in...
Effect of Cadmium Toxicity on Different Antioxidant Enzymes in Growing Wheat (Triticum aestivum L.) Seedlings
Assessing the impact of Cadmium (Cd) on plant cells requires an understanding of the defensive mechanisms and adaptive responses employed by plants to counteract the deleterious effects of Cd toxicity. Wheat seedlings we...