【论文浅读】《Traffic scene recognition based on deep CNN and VLAD spatial pyramids》_综合

创新思想：

为了提高了处理交通场景识别的能力，这篇论文将CNN的特征表示与VLAD编码方案相结合。具体是先用CNN求出区域建议算法生成的图像块的特征值，再用VLAD编码可以使图像表示的形式更紧凑。还用空间金字塔得到空间信息。

主要原理：

【论文浅读】《Traffic scene recognition based on deep CNN and VLAD spatial pyramids》
模型的示意图，用FC6表示特征，主成分分析(PCA)方法降维，K-均值聚类进行质心学习，VLAD编码，支持向量机分类器进行交通场景分类。

流程图，和上面一样。
【论文浅读】《Traffic scene recognition based on deep CNN and VLAD spatial pyramids》
训练集中的不同场景。

主要实验结果：

【论文浅读】《Traffic scene recognition based on deep CNN and VLAD spatial pyramids》
左：CNN结果直接分类；
中：CNN经过PCA、Kmeans、VLAD再分类；
右：CNN经过PCA、Kmeans、空间金字塔、VLAD再分类；

【论文浅读】《Traffic scene recognition based on deep CNN and VLAD spatial pyramids》
混淆矩阵分别为一、二、三层时对不同场景下的正确率。