This document summarizes a research paper that proposes a two-level feature extraction method using both syntax and semantic algorithms to improve fault diagnosis of aerospace systems. At the syntax level, an improved Chi-squared statistic called ICHI is used to select features and address issues with unbalanced data sets. At the semantic level, a topic modeling approach called PLDA that incorporates prior domain knowledge into LDA is used to further extract features. The extracted features from both levels are then combined to boost the performance of support vector machine classification, especially for minority fault categories.