问题：

Flume不适用于HDFS

张翰音

2023-03-14

我用两个节点设置水槽。我想将数据从slave01加载到hdfs。slave01:示例配置属性

 agent.sources = baksrc
agent.channels = memoryChannel
agent.sinks =avro-forward-sink 
agent.sources.baksrc.type = exec
agent.sources.baksrc.command = tail -F /root/hadoo/test/data.txt
agent.sources.baksrc.checkperiodic = 1000
agent.sources.baksrc.channels =memoryChannel 
agent.channels.memoryChannel.type = memory
agent.channels.memoryChannel.keep-alive = 30
agent.channels.memoryChannel.capacity = 10000
agent.channels.memoryChannel.transactionCapacity = 10000
agent.sinks.avro-forward-sink.type = avro
agent.sinks.avro-forward-sink.hostname = master
agent.sinks.avro-forward-sink.port = 23004
agent.sinks.avro-forward-sink.channel = memoryChannel

master:示例配置属性

agent.sources = avrosrc
agent.sinks =hdfs-write 
agent.channels = memoryChannel
agent.sources.avrosrc.type =avro
agent.sources.avrosrc.bind =master
agent.sources.avrosrc.port =23004
agent.sources.avrosrc.channels=memoryChannel
agent.channels.memoryChannel.type = memory
agent.channels.memoryChannel.keep-alive = 30
agent.channels.memoryChannel.capacity = 10000
agent.channels.memoryChannel.transactionCapacity =10000
agent.sinks.hdfs-write.type = hdfs
agent.sinks.hdfs-write.hdfs.path =hdfs://172.16.86.38:9000/flume/webdata
agent.sinks.hdfs-write.hdfs.rollInterval = 0   
agent.sinks.hdfs-write.hdfs.rollSize = 4000000  
agent.sinks.hdfs-write.hdfs.rollCount = 0   
agent.sinks.hdfs-write.hdfs.writeFormat = Text  
agent.sinks.hdfs-write.hdfs.fileType = DataStream  
agent.sinks.hdfs-write.hdfs.batchSize = 10  
agent.sinks.hdfs-write.channel=memoryChannel

然后我运行一个shell脚本：如下所示：

#!/bin/sh
for i in {1..1000000}; do
    echo "test flume to Hbase $i" >>/root/hadoop/test/data.txt; 
    sleep 0.1; 
done

启动 flume： flume-ng agent --conf conf conf --conf-file example-conf.properties --name agent -dflume.root.logger=DEBUG，console 我在控制台上没有错误。

14/05/06 16:38:44 INFO source.AvroSource: Avro source avrosrc stopping: Avro source avrosrc: { bindAddress: master, port: 23004 }
14/05/06 16:38:44 INFO ipc.NettyServer: [id: 0x49f2de1b, /172.16.86.39:9359 :> /172.16.86.38:23004] DISCONNECTED
14/05/06 16:38:44 INFO ipc.NettyServer: [id: 0x49f2de1b, /172.16.86.39:9359 :> /172.16.86.38:23004] UNBOUND
14/05/06 16:38:44 INFO ipc.NettyServer: [id: 0x49f2de1b, /172.16.86.39:9359 :> /172.16.86.38:23004] CLOSED

但是我看不到hdfs中的文件，我的配置有问题吗？？我已经在master上测试过了，效果很好。

共有1个答案

堵凯

2023-03-14

您使用哪个版本的水槽？
你设置HADOOP_HOME了吗？
Flume是否从HADOOP_HOME输出带有Hadoop jar的类路径？
如果您使用的是Apache Flume，那么一步一步： 1. 设置HADOOP_HOME
2. 编辑 Hadoop核心站点.xml，确保 namenode IP 正确。
3. 使用 HDFS 路径：agent.sinks.hdfs-write.hdfs.path =/flume/webdata
4. 启动水槽

类似资料：

自定义拦截器不适用于Apache Flume

我有一个Flume组件在监听Syslog流。我做了一个自定义的拦截器来修改调用，但它不起作用。我做错了什么？谢谢你，Andrea 拦截器是一个编译良好的JAR文件，位于@FLUME_HOME/bin目录中系统将事件记录在文件中而不修改它们，这是相关的DEBUG日志：
PhantomJSDriver适用于HTTP，但不适用于HTTPS

问题内容：嗨，我只是简单地尝试在www.example.com上获取h1标签，该标签显示为“ Example Domain”。该代码适用于http://www.example.com，但不适用于https://www.exmaple.com。我该如何解决这个问题？谢谢问题答案： PhantomJSDriver不支持（所有）DesiredCapabilities。你会需要：记录在这里：htt
BufferedWriter适用于Windows，但不适用于Mac

所以我使用这种方法写入文件，它在windows上运行完全正常，但在mac上运行时，它会创建文件，但它们是空的。我知道数据是正确的，因为它打印正确。感谢您的任何帮助，这真的让我绊倒了。
查询适用于postgresql，但不适用于hsqldb

列名称的类型为int[] 上述查询适用于postgresql，但不适用于hsqldb，甚至适用于sql 尝试的hsqldb版本：2.2.9和2.3.0 在hsqldb中工作的sql是从table_name中选择x，unnest（column_name）y（x）x和y不是该表的列。
MFP 8.0 API适用于POSTMAN，但不适用于AJAX

我能够成功地打电话给邮递员： /mfp/api/az/v1/token和 /mfpadmin/management-apis/2.0/runtimes/mfp/applications 我正在获取从/mfp/api/az/v1/token接收的承载令牌，并将其添加到/mfp/applications的授权标头中。我收到了来自两者的200个响应，并从每个API中获取了预期的信息。然后，我选择从P
SonarRunner适用于Sonar 3.2，但不适用于SonarQube 4.4

我一直在使用声纳3.2 同样的配置，当我升级到SonarQube 4.4时声纳项目属性: 请帮助整理这些例外问候， KP

Flume不适用于HDFS

共有1个答案

相关问答

相关文章

相关阅读

相关工具

相关文档