The paper presents an efficient classification and dimensionality reduction approach for big data using the parallel generalized Hebbian algorithm (GHA), implemented on the Spark Radoop platform. This method is shown to outperform existing techniques in handling high-dimensional datasets, addressing the challenges posed by the massive volume and complexity of big data. The research includes extensive comparisons with other machine learning algorithms and evaluates performance using multiple large datasets.