[1] Jacobs A.The pathologies of big data[J].Communications of the ACM,2009,52(8):3644.
[2]Ghemawat S,Gobioff H,Leung S T.The Google file system[J].ACM SIGOPS Operating Systems Review,2003,37(5):2943.
[3]Shvachko K, Kuang H, Radia S, et al. The Hadoop distributed file system[C]∥Proc of IEEE Conference on Mass Storage Systems and Technologies,2010:110.
[4]Floratou A,Minhas U F,Ozcan F.SQLonHadoop: Full circle back to sharednothing database architectures[J].Proceedings of the VLDB Endowment,2014,7(12):12951306.
[5]Batory D S.On searching transposed files[J].ACM Transactions on Database Systems,1979,4(4):531544.
[6]Copeland G P, Khoshafian S N. A decomposition storage model[J].ACM SIGMOD Record,1985,14(4):268279.
[7]He Y, Lee R, Huai Y, et al.RCFile: A fast and spaceefficient data placement structure in MapReducebased warehouse systems[C]∥Proc of the 2011 IEEE 27th International Conference on Data Engineering,2011:11991208.
[8]Hortonworks Inc.ORC Files [EB/OL]. [20160209].ht
tps://issues.apache.org/jira/secure/attachment/12564124/OrcFileIntro.pptx.
[9]Trevni [EB/OL]. [20160209]. https://github.com/cutting/trevni.
[10]Cloudera Enterprise. Parquet[EB/OL].[20160209].https://github.com/Parquet.
[11]Chen S.Cheetah: A high performance,custom data warehouse on top of MapReduce[J].Proceedings of the VLDB Endowment,2010,3(12):14591468. |