Exploring cGANsfor Urdu Alphabets and Numerical System Generation
Journal Title: International Journal of Innovations in Science and Technology - Year 2025, Vol 7, Issue 5
Abstract
Urdu ligatures play a crucial role in text representation and processing, especially in Urdu language applications. While extensive research has been conducted on handwritten characters in various languages, there is still a significant gap in studying raster-based generated images of Urdu characters. This paper presents a generative model designed to produce high-quality samples that closely resemble yet differ from existing datasets. Utilizing the power of Generative Adversarial Networks (GANs), the model is trained on a diverse dataset comprising 40 classes of Urdu alphabets and 20 classes of numerals (both modern and Arabic-style), with each class containing 1,000 augmented images to capture variations. The generator network creates synthetic Urdu character samples based on class conditions, while the discriminator network evaluates their similarity to real datasets. The model’s effectiveness is assessed using key metrics such as the Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index (SSIM), and Fréchet Inception Distance (FID). The results confirm that the proposed GAN-based approach achieves high fidelity and structural accuracy, making it highly valuable for applications in text digitization and Optical Character Recognition (OCR).
Authors and Affiliations
Suleman Khalil, Syed Yasser Arafat, Fatima Bibi, Faiza Shafique
Towards End-to-End Speech Recognition System for Pashto Language Using Transformer Model
The conventional use of Hidden Markov Models (HMMs), and Gaussian Mixture Models (GMMs) for speech recognition posed setup challenges and inefficiency. This paper adopts the Transformer model for Pashto continuous sp...
A Comparative Analysis of BER Performance for NOMA in the Presence of Rayleigh Fading and Impulse Noise
Importance of Study: This research investigates the integration of wired and wireless communication in Smart Grid (SG) systems, addressing the challenges posed by impulse noise and the increasing demand for bandwidth....
Efficient Region-Based Video Text Extraction Using Advanced Detection and Recognition Models
his paper presents an automated process for extracting text from video frames by specifically targeting text-rich regions, identified through advanced scene text detection methods. Unlike traditional techniques that ap...
Artificial Intelligence-BasedApproach forThe Recommendations ofMango Supply Chain
This study utilizes a comprehensive dataset that encompasses variables reflecting temperature, humidity, precipitation, inventory levels, transportation modes, freshness scores, and ripeness scores. Compiled from vari...
Deep Learning-Based Image Captioning for Visual Impairment Using a VGG16 and LSTM Approach
Visually impaired people face the challenge of gathering information about their surroundings. They are unable to make sense of visually presented information such as capturing images, reading sign boards, moving aroun...