当前位置：首页 > 软件库 > 程序开发 > 日志工具(Logging) >

datacube

数据立方体

授权协议 Apache

开发语言 Java

所属分类程序开发、日志工具(Logging)

软件类型开源软件

地区不详

投递者刁英朗

操作系统跨平台

开源组织无

适用人群未知

软件官网

软件文档

官方下载

软件概览

数据立方体是复杂计算的抽象。Datacube 是用 Java 实现的，可插入数据库后端支持的数据立方体。

datacube 是用来存储大数据点的聚合信息。数据立方体存储的是有趣输入数据点的子集。比如，你正在编写一个 web 服务器日志分析工具，你的输入点可能是日志行，你可能会计算每个浏览器的类型，每个浏览器的版本，操作系统类型，操作系统版本和其他属性。同时你可能会需要计算一个特定的组合计数(浏览器类型，浏览器版本，操作系统类型)， (浏览器类型，浏览器版本，操作系统类型，操作系统版本)，等等。

这对快速添加和修改计数是个很大的挑战，会浪费很多时间在数据库代码和重新用新计数器处理旧数据。而数据立方体就可以帮忙解决这些问题。

Urban Airship 使用 datacube 项目来支持他们的移动端应用的分析栈，每个节点每秒处理大约 10 K 的事件。

datacube 要求 JDK 1.6。

特性

性能：高速异步 IO 后端处理
使用 Hadoop MapReduce 进行批量加载
可插入数据库接口

datacube 暂时只支持 HBase 数据库后端。

示例：

IdService idService = new CachingIdService(5, new MapIdService());
ConcurrentMap<BoxedByteArray,byte[]> backingMap = 
        new ConcurrentHashMap<BoxedByteArray, byte[]>();
DbHarness<LongOp> dbHarness = new MapDbHarness<LongOp>(backingMap, LongOp.DESERIALIZER, 
        CommitType.READ_COMBINE_CAS, idService);
HourDayMonthBucketer hourDayMonthBucketer = new HourDayMonthBucketer();
Dimension<DateTime> time = new Dimension<DateTime>("time", hourDayMonthBucketer, false, 8);
Dimension<String> zipcode = new Dimension<String>("zipcode", new StringToBytesBucketer(), 
        true, 5);
DataCubeIo<LongOp> cubeIo = null;
DataCube<LongOp> cube;
Rollup hourAndZipRollup = new Rollup(zipcode, time, HourDayMonthBucketer.hours);
Rollup dayAndZipRollup = new Rollup(zipcode, time, HourDayMonthBucketer.days);
Rollup hourRollup = new Rollup(time, HourDayMonthBucketer.hours);
Rollup dayRollup = new Rollup(time, HourDayMonthBucketer.days);
List<Dimension<?>> dimensions =  ImmutableList.<Dimension<?>>of(time, zipcode);
List<Rollup> rollups = ImmutableList.of(hourAndZipRollup, dayAndZipRollup, hourRollup,
        dayRollup);
cube = new DataCube<LongOp>(dimensions, rollups);
cubeIo = new DataCubeIo<LongOp>(cube, dbHarness, 1, Long.MAX_VALUE, SyncLevel.FULL_SYNC);
DateTime now = new DateTime(DateTimeZone.UTC);
// Do an increment of 5 for a certain time and zipcode
cubeIo.writeSync(new LongOp(5), new WriteBuilder(cube)
        .at(time, now)
        .at(zipcode, "97201"));
// Do an increment of 10 for the same zipcode in a different hour of the same day
DateTime differentHour = now.withHourOfDay((now.getHourOfDay()+1)%24);
cubeIo.writeSync(new LongOp(10), new WriteBuilder(cube)
        .at(time, differentHour)
        .at(zipcode, "97201"));
// Read back the value that we wrote for the current hour, should be 5 
Optional<LongOp> thisHourCount = cubeIo.get(new ReadBuilder(cube)
         .at(time, HourDayMonthBucketer.hours, now)
        .at(zipcode, "97201"));
Assert.assertTrue(thisHourCount.isPresent());
Assert.assertEquals(5L, thisHourCount.get().getLong());
// Read back the value we wrote for the other hour, should be 10
Optional<LongOp> differentHourCount = cubeIo.get(new ReadBuilder(cube)
        .at(time, HourDayMonthBucketer.hours, differentHour)
        .at(zipcode, "97201"));
Assert.assertTrue(differentHourCount.isPresent());
Assert.assertEquals(10L, differentHourCount.get().getLong());
// The total for today should be the sum of the two increments
Optional<LongOp> todayCount = cubeIo.get(new ReadBuilder(cube)
        .at(time, HourDayMonthBucketer.days, now)
        .at(zipcode, "97201"));
Assert.assertTrue(todayCount.isPresent());
Assert.assertEquals(15L, todayCount.get().getLong());

使用案例

OpenDataCube安装教程（一）——依赖环境安装

一.OpenDataCube依赖环境的安装 1.miniconda安装借助miniconda安装python，便于多种Python环境的管理。 1）下载最新版本的minconda wget https://mirrors.tuna.tsinghua.edu.cn/anaconda/miniconda/Miniconda3-latest-Linux-x86_64.sh 注意：若提示wget命令不
OpenDataCube安装教程（三）——DataCube NoteBooks的安装

三、DataCube NoteBooks的安装 1.下载源码 (odc) [root@song software]# git clone https://github.com/opendatacube/datacube-notebooks.git Cloning into 'datacube-notebooks'... remote: Enumerating objects: 121, done.
DataCube安装文档

英文文档中有一些坑，最后导入数据还没搞定，以后再更新参考 http://www.ceos-cube.org/docs/installation/index.html http://datacube-core.readthedocs.io/en/latest/index.html #下载源码与相关工具 mkdir ~/Datacube sudo apt-get update sudo apt
数蓬科技的datacube打开CRT报错如何解决？

有的企业定制的datacube往往会出现CRT和SCP打不开的情况，这时候你只需将包含CRT和SCP的文件夹换一个路径就可以了，欢迎评论区留言讨论。
Data Cube介绍

以下是转自SlidesShare.com上面的一份PPT，概念讲得很简单清晰。转载于:https://www.cnblogs.com/mush0m/p/3633766.html
DataCube：idea2019找不到配置文件路径

问题描述：在DataCube中，使用idea2019版，运行mybatis-spring1.2.2项目时，在确认所有配置路径都没问题的情况下，程序找不到配置路径，所找的路径是沙箱占用空间外的存储路径。原因：沙箱设置中记录的登录账号和实际登录账号不一致。解决办法：用沙箱记录的账号登录；或者，把路径改成沙箱占用空间内的路径。
Notes of “Quotient Cube: How to Summarize the Semantics of a Data Cube”

Notes of "Quotient Cube: How to Summarize the Semantics of a Data Cube" •1. Terminology(相关术语) Roll up：向上综合 Drill down：向下细化或者向下钻取 w.r.t. ：with respect to 关于 lattice: A lattice is partially ordered set

datacube

特性

同类工具

相关阅读

相关文章

相关问答

相关文档