HEPATITIS C: Classification and Prediction Using Scikit-Learn, Keras, and TensorFlow with Python GUI

·
· BALIGE PUBLISHING
4.7
7 reviews
Ebook
318
Pages
Ratings and reviews aren’t verified  Learn More

About this ebook

In this comprehensive project focusing on Hepatitis C classification and prediction, the journey begins with a meticulous exploration of the dataset. Through Python, Scikit-Learn, Keras, and TensorFlow, the project aims to develop an effective model to predict Hepatitis C based on given features. The dataset's attributes are systematically examined, and their distributions are analyzed to uncover insights into potential correlations and patterns.


The subsequent step involves categorizing the feature distributions. This phase sheds light on the underlying characteristics of each attribute, facilitating the understanding of their roles in influencing the target variable. This categorization lays the foundation for feature scaling and preprocessing, ensuring that the data is optimized for machine learning.


The core of the project revolves around the development of machine learning models. Employing Scikit-Learn, various classification algorithms are applied, including K-Nearest Neighbors (KNN), Decision Trees, Random Forests, Naive Bayes, Gradient Boosting, AdaBoost, Light Gradient Boosting, Multi-Layer Perceptron, and XGBoost. The models are fine-tuned using Grid Search to optimize hyperparameters, enhancing their performance and generalization capability.


Taking the project a step further, deep learning techniques are harnessed to tackle the Hepatitis C classification challenge. A key component is the construction of an Artificial Neural Network (ANN) using Keras and TensorFlow. This ANN leverages layers of interconnected nodes to learn complex patterns within the data. LSTM, FNN, RNN, DBN, and Autoencoders are also explored, offering a comprehensive understanding of deep learning's versatility.


To evaluate the models' performances, an array of metrics are meticulously employed. Metrics such as accuracy, precision, recall, F1-score, and AUC-ROC are meticulously calculated. The significance of each metric is meticulously explained, underpinning the assessment of a model's true predictive power and its potential weaknesses. The evaluation phase emerges as a pivotal aspect, accentuated by an array of comprehensive metrics. Performance assessment encompasses metrics such as accuracy, precision, recall, F1-score, and ROC-AUC. Cross-validation and learning curves are strategically employed to mitigate overfitting and ensure model generalization. Furthermore, visual aids such as ROC curves and confusion matrices provide a lucid depiction of the models' interplay between sensitivity and specificity.


The culmination of the project involves the creation of a user-friendly Graphical User Interface (GUI) using PyQt. The GUI enables users to interact seamlessly with the models, facilitating data input, model selection, and prediction execution. A detailed description of the GUI's components, including buttons, checkboxes, and interactive plots, highlights its role in simplifying the entire classification process.


In a comprehensive journey of exploration, experimentation, and analysis, this project effectively marries data science and machine learning. By thoroughly examining the dataset, engineering features, utilizing a diverse range of machine learning models, harnessing the capabilities of deep learning, evaluating performance metrics, and creating an intuitive GUI, the project encapsulates the multi-faceted nature of modern data-driven endeavors.

Ratings and reviews

4.7
7 reviews
Anil Das
September 2, 2024
AAA
Did you find this helpful?
Anil Das
August 14, 2024
AAA
Did you find this helpful?

About the author

Vivian Siahaan is a fast-learner who likes to do new things. She was born, raised in Hinalang Bagasan, Balige, on the banks of Lake Toba, and completed high school education from SMAN 1 Balige. She started herself learning Java, Android, JavaScript, CSS, C ++, Python, R, Visual Basic, Visual C #, MATLAB, Mathematica, PHP, JSP, MySQL, SQL Server, Oracle, Access, and other programming languages. She studied programming from scratch, starting with the most basic syntax and logic, by building several simple and applicable GUI applications. Animation and games are fields of programming that are interests that she always wants to develop. Besides studying mathematical logic and programming, the author also has the pleasure of reading novels. Vivian Siahaan has written dozens of ebooks that have been published on Sparta Publisher: Data Structure with Java; Java Programming: Cookbook; C ++ Programming: Cookbook; C Programming For High Schools / Vocational Schools and Students; Java Programming for SMA / SMK; Java Tutorial: GUI, Graphics and Animation; Visual Basic Programming: From A to Z; Java Programming for Animation and Games; C # Programming for SMA / SMK and Students; MATLAB For Students and Researchers; Graphics in JavaScript: Quick Learning Series; JavaScript Image Processing Methods: From A to Z; Java GUI Case Study: AWT & Swing; Basic CSS and JavaScript; PHP / MySQL Programming: Cookbook; Visual Basic: Cookbook; C ++ Programming for High Schools / Vocational Schools and Students; Concepts and Practices of C ++; PHP / MySQL For Students; C # Programming: From A to Z; Visual Basic for SMA / SMK and Students; C # .NET and SQL Server for High School / Vocational School and Students. At the ANDI Yogyakarta publisher, Vivian Siahaan also wrote a number of books including: Python Programming Theory and Practice; Python GUI Programming; Python GUI and Database; Build From Zero School Database Management System In Python / MySQL; Database Management System in Python / MySQL; Python / MySQL For Management Systems of Criminal Track Record Database; Java / MySQL For Management Systems of Criminal Track Records Database; Database and Critptography Using Java / MySQL; Build From Zero School Database Management System With Java / MySQL.

Rismon Hasiholan Sianipar was born in Pematang Siantar, in 1994. After graduating from SMAN 3 Pematang Siantar 3, the writer traveled to the city of Jogjakarta. In 1998 and 2001 the author completed his Bachelor of Engineering (S.T) and Master of Engineering (M.T) education in the Electrical Engineering of Gadjah Mada University, under the guidance of Prof. Dr. Adhi Soesanto and Prof. Dr. Thomas Sri Widodo, focusing on research on non-stationary signals by analyzing their energy using time-frequency maps. Because of its non-stationary nature, the distribution of signal energy becomes very dynamic on a time-frequency map. By mapping the distribution of energy in the time-frequency field using discrete wavelet transformations, one can design non-linear filters so that they can analyze the pattern of the data contained in it. In 2003, the author received a Monbukagakusho scholarship from the Japanese Government. In 2005 and 2008, he completed his Master of Engineering (M.Eng) and Doctor of Engineering (Dr.Eng) education at Yamaguchi University, under the guidance of Prof. Dr. Hidetoshi Miike. Both the master's thesis and his doctoral thesis, R.H. Sianipar combines SR-FHN (Stochastic Resonance Fitzhugh-Nagumo) filter strength with cryptosystem ECC (elliptic curve cryptography) 4096-bit both to suppress noise in digital images and digital video and maintain its authenticity. The results of this study have been documented in international scientific journals and officially patented in Japan. One of the patents was published in Japan with a registration number 2008-009549. He is active in collaborating with several universities and research institutions in Japan, particularly in the fields of cryptography, cryptanalysis and audio / image / video digital forensics. R.H. Sianipar also has experience in conducting code-breaking methods (cryptanalysis) on a number of intelligence data that are the object of research studies in Japan. R.H. Sianipar has a number of Japanese patents, and has written a number of national / international scientific articles, and dozens of national books. R.H. Sianipar has also participated in a number of workshops related to cryptography, cryptanalysis, digital watermarking, and digital forensics. In a number of workshops, R.H. Sianipar helps Prof. Hidetoshi Miike to create applications related to digital image / video processing, steganography, cryptography, watermarking, non-linear screening, intelligent descriptor-based computer vision, and others, which are used as training materials. Field of interest in the study of R.H. Sianipar is multimedia security, signal processing / digital image / video, cryptography, digital communication, digital forensics, and data compression / coding. Until now, R.H. Sianipar continues to develop applications related to analysis of signal, image, and digital video, both for research purposes and for commercial purposes based on the Python programming language, MATLAB, C ++, C, VB.NET, C # .NET, R, and Java.

Rate this ebook

Tell us what you think.

Reading information

Smartphones and tablets
Install the Google Play Books app for Android and iPad/iPhone. It syncs automatically with your account and allows you to read online or offline wherever you are.
Laptops and computers
You can listen to audiobooks purchased on Google Play using your computer's web browser.
eReaders and other devices
To read on e-ink devices like Kobo eReaders, you'll need to download a file and transfer it to your device. Follow the detailed Help Center instructions to transfer the files to supported eReaders.