基于分块DCT的视频文字检测算法

刘凌霞1，牛红惠1，崔洲涓2

doi:10.3969/j.issn.1007130X.2011.

计算机工程与科学 >

2011 , Vol. 33 >Issue 6: 63 - 66

DOI: https://doi.org/10.3969/j.issn.1007130X.2011.

论文

基于分块DCT的视频文字检测算法

展开

（1.安阳师范学院计算机教学部，河南安阳 455000;
2.西安理工大学机械与精密仪器工程学院，陕西西安 710048）

刘凌霞(1977),女,河南安阳人，硕士，讲师，研究方向为图像图形处理和数据挖掘。牛红惠(1972),女,河南濮阳人，硕士,讲师,研究方向为计算机网络、神经网络算法及应用和数据挖掘。崔洲涓(1986),女,河南安阳人，硕士生，研究方向为光通信和FPGA设计。

收稿日期: 2010-12-01

修回日期: 2011-02-26

网络出版日期: 2011-06-25

收起

A Novel DCTBased Video Text Detection Algorithm

Expand

(1.Computer Education Department,Anyang Normal University,Anyang 455000;
2.School of Mechanical and Instrumental Engineering,Xi’an University of Technology,Xi’an 710048,China)

Received date: 2010-12-01

Revised date: 2011-02-26

Online published: 2011-06-25

Fold

摘要

针对大量视频图像中出现的各种文字信息，本文提出了一种基于离散余弦变换（DCT）的文字提取算法。该方法首先将图像分割为等大小基本块，然后对各小块提取DCT特征。在此基础上，利用图像对比度，设计了一种动态阈值分割方法，可将文字信息和背景信息进行分离。然后依据最小外接矩形算法，获得初始文字检测结果。最终使用Voronoi Diagram算法对初始区域进行合并得到最终文字区域检测结果。算法可以快速而精确定位文字所对应的区域，并且能适用于各种背景条件下的视频图像。

关键词： 视频图像; 文字识别; 检测; 离散余弦变换; 结构分析

本文引用格式

刘凌霞1，牛红惠1，崔洲涓2 . 基于分块DCT的视频文字检测算法[J]. 计算机工程与科学, 2011 , 33(6) : 63 -66 . DOI: 10.3969/j.issn.1007130X.2011.

Abstract

To help users navigate the libraries of video, algorithms that automatically index video based on the content are needed. In this paper, we present a DCT based approach to detect texts and captions from the videos. The use of these features is in a flexible manner thus can be adapted to different applications. Language independence is an important advantage of the proposed method. Experiments are conducted on a large volume of real video shots. Solutions are proposed for each of these problems and compared with the existing work found in the literature.

Key words： video;text recognition;detection;DCT;structural analysis

参考文献

［1］郑翠翠, 王兴起. 基于边缘信息和局部直方图的视频文字检测法［J］. 机电工程,2009,26(10):3133.
［2］黄剑, 赵黎, 杨士强. 视频文字检测与多尺度定位算法［J］. 清华大学学报(自然科学版), 2004,44(1):5053.
［3］Antani S, Crandall D, Narasimamurthy A, et al. Evaluation of Methods for Extraction of Text from Video［C］∥Proc of IAPR Int’l Workshop on Document Analysis Systems, 2000:507514.
［4］黄旭, 陈驰, 曾江源, 等.基于离散余弦变换的遥感影像纹理分类［J］. 江苏科技信息，2009 (6).
［5］Kise K, Sato A,Iwata M. Segmentation of Page Images Using the Area Voronoi Diagram［J］. Computer Vision and Image Understanding, 1998,70(3):370382.

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract

参考文献