基于结构相似匹配的SQL程序自动评估模型研究

doi:10.3969/j.issn.1007130X.2010.

J4 ›› 2010, Vol. 32 ›› Issue (11): 92-96.doi: 10.3969/j.issn.1007130X.2010.

基于结构相似匹配的SQL程序自动评估模型研究

杨鹤标，刘玲，杨立凡

（江苏大学计算机科学与通信工程学院，江苏镇江 212013）

收稿日期:2009-09-27 修回日期:2009-12-13 出版日期:2010-11-25 发布日期:2010-11-25
通讯作者: 杨鹤标
作者简介:杨鹤标(1960)，男，江苏镇江人，教授，研究方向为数据挖掘、软件体系结构、信息系统和软件工程；刘玲，硕士生，研究方向为数据库和数据挖掘；杨立凡，本科生。
基金资助:
江苏省高技术研究资助项目(BG2007028)

A Study of the Automated Programming Assessment Model for SQL Based on Structure Similarity Matching

YANG Hebiao，LIU Ling，YANG Lifan

(School of Computer Science and Telecommunications Engineering,Jiangsu University,Zhenjiang 212013,China)

Received:2009-09-27 Revised:2009-12-13 Online:2010-11-25 Published:2010-11-25

摘要/Abstract

摘要： 针对SQL语言编程能力评估的多因素影响、界限模糊特性造成的难度和偏差问题，本文提出了基于结构相似度匹配的评估模型（SQLAPAM）。结合静态评估与动态评估方法，给出了模型的整体框架；模型对提交的SQL语句进行规范化、分词处理后，将其转换成等价的单词序列对，进而构建对应的结构树Stree；使用于代价模型、子结构贡献因子两方面上有所改进的树编辑距离算法计算与目标树的相似性值；最后利用正态分布思想将相似度值映射到成绩区间,并通过相似度阈值来调整影响因素所带来的偏差，给出SQL程序的定量评判结果。最后对模型作了基于数据的实验分析与验证，训练数据集进行参数调整，对模型进行优化。

关键词: 相似性分析, 自动评估, 分词, 树编辑距离, 正态分布

Abstract: In view of the difficulty and the diviation caused by the features of multifactor and fuzzy boundaries of the automated programming assessment model for SQL languages(SQLAPAM),this paper introduces an assessment model based on structure similarity matching.The overall framework of the model is proposed with the combination of static and dynamic assessment methods. After being processed by standardization and tokenization, the submitted SQL statements are transformed into the equivalent sequence of token pairs with which the model constructs a corresponding structure tree(Stree). Next the model calculates similarity between the acquired tree and the target tree using the tree edit distance improved in the cost model and the substructure contribution factor,and gains a similarity threshold. Finally, the model maps similarity to the score intervals with reference to the normal distribution theory and adjusts the deviation brought by the impact factors with the help of the similarity threshold. Meanwhile the final assessment result for the SQL program is provided.

杨鹤标，刘玲，杨立凡. 基于结构相似匹配的SQL程序自动评估模型研究[J]. J4, 2010, 32(11): 92-96.

YANG Hebiao，LIU Ling，YANG Lifan. A Study of the Automated Programming Assessment Model for SQL Based on Structure Similarity Matching[J]. J4, 2010, 32(11): 92-96.

编辑推荐

Metrics

阅读次数

全文

124

HTML			PDF

最新录用	在线预览	正式出版	最新录用	在线预览	正式出版
0	0	0	0	0	124

来源	本网站	其他网站

次数	68	56
比例	55%	45%

摘要

最新录用	在线预览	正式出版

0	0	60

	来源	本网站

	次数	60
	比例	100%

[1]	杨春霞, 姚思诚, 宋金剑, . 一种融合字词信息的中文情感分析模型[J]. 计算机工程与科学, 2023, 45(03): 512-519.
[2]	何力，周兰江，周枫，郭剑毅. 基于双向长短期记忆神经网络的老挝语分词方法[J]. 计算机工程与科学, 2019, 41(07): 1312-1317.
[3]	陶广奉，线岩团，王红斌，汪淑娟. 融合上下文字符信息的泰语神经网络分词方法[J]. 计算机工程与科学, 2018, 40(05): 943-949.
[4]	殷红1，杜国璋1，彭珍瑞1，马丽2. 野草猴群算法的传感器优化布置方法研究[J]. 计算机工程与科学, 2018, 40(04): 626-635.
[5]	张钊1,2,3，张新峰1,2,3，郑楠1,2,3，贵明俊1,2,3. 基于Hadoop平台的LDA算法的并行化实现[J]. J4, 2016, 38(02): 231-239.
[6]	李康顺，王法杰，张楚湖，杨磊，陈琰. 一种基于JADE改进的差分演化算法[J]. J4, 2015, 37(09): 1698-1706.
[7]	杨文川，刘健，于淼. 基于双数组Trie树的中文分词词典算法优化研究[J]. J4, 2013, 35(9): 127-131.
[8]	吴洁明，韩云辉，冀单单. 基于Lucene的数字作品搜索引擎的研究与设计[J]. J4, 2013, 35(5): 166-172.
[9]	才智杰,才让卓玛. 藏文自动分词系统的设计[J]. J4, 2011, 33(5): 151-154.
[10]	孙〓伟，邢长征. 关于中文文档复制检测算法的改进[J]. J4, 2010, 32(8): 101-103.
[11]	张敏,王春红. 基于统计方法的Web新词分词方法研究[J]. J4, 2010, 32(5): 133-135.
[12]	徐飞孙劲光. 中文分词切分技术研究[J]. J4, 2008, 30(5): 126-128.
[13]	吴振南[1] 熊皓[2] 徐爱萍[2]. GIS中文查询语句的未登录词识别算法研究[J]. J4, 2007, 29(11): 81-83.
[14]	张汛涞. 搜索引擎的设计剖析[J]. J4, 2002, 24(4): 18-20.
[15]	殷建平. 汉语自动分词方法[J]. J4, 1998, 20(3): 60-66.

基于结构相似匹配的SQL程序自动评估模型研究

A Study of the Automated Programming Assessment Model for SQL Based on Structure Similarity Matching

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 15

编辑推荐

Metrics

本文评价