union:
合并两个RDD
示例代码:
def my_union(): data = [1,2,3,4,5] rdd1 =sc.parallelize(data) data2 = [6,7,8,9,10] rdd2 = sc.parallelize(data2) result = rdd1.union(rdd2)
print(result.collect())
输出结果:
[1, 2, 3, 4, 5, 6, 7, 8, 9, 10]