未加星标

Delimitants of Apache Pig stores

字体大小 | |
[数据库(综合) 所属分类 数据库(综合) | 发布者 店小二05 | 时间 2019 | 作者 红领巾 ] 0人收藏点击收藏

I'm using Pig Latin to store values from an alias into the HDFS. The alias contains a semicolon in one of its fields.

dump A; (Richard & John, 1993) (Albert, 1994)

A table that shows the data in the HDFS, but the semicolon makes John go to the next column.

| Name | Year | |--------------|------| | Richard &amp | John | | Albert | 1994 |

Trying to use store like this is also not working as expected:

STORE A INTO '/user/hive/warehouse/test.db/names' using PigStorage('\t');

but even when telling PigStore to use tab as delimiter the semicolon breaks the table data. How can I fix it?

I just locally create a file suppose a.txt and copy your data into this file.

(Richard & John, 1993) (Albert, 1994)

Now I see that your data is not in tab delimiter form and that's why it split after semicolon part.So to solve this problem i just right a query like this

data = load '/home/hduser/Desktop/a.txt' using PigStorage(','); dump data;

and my output result is this

((Richard & John, 1993)) ((Albert, 1994))

I split it using this

,

because your data looks like this delimiter.

Note: I run it my local file system.So to run it locally you must start your pig using this command pig -x local and give your relevant path.

本文数据库(综合)相关术语:系统安全软件

代码区博客精选文章
分页:12
转载请注明
本文标题:Delimitants of Apache Pig stores
本站链接:https://www.codesec.net/view/628209.html


1.凡CodeSecTeam转载的文章,均出自其它媒体或其他官网介绍,目的在于传递更多的信息,并不代表本站赞同其观点和其真实性负责;
2.转载的文章仅代表原创作者观点,与本站无关。其原创性以及文中陈述文字和内容未经本站证实,本站对该文以及其中全部或者部分内容、文字的真实性、完整性、及时性,不作出任何保证或承若;
3.如本站转载稿涉及版权等问题,请作者及时联系本站,我们会及时处理。
登录后可拥有收藏文章、关注作者等权限...
技术大类 技术大类 | 数据库(综合) | 评论(0) | 阅读(128)