Greenplum 精粹文集scalability),Greenplum 可以采用各种开发语言来扩展用户自定 义函数(UDF)(我个人是 Python 和 C 的 fans,后续章节与大家分享)。 这些自定义函数部署到 Greenplum 后可用充分享受到实例级别的并行 性能优势,我们强烈建议用户将库外的处理逻辑,部署到用 MPP 数 据库的 UDF 这种 In-Database 的方式来处理,你将获得意想不到的性 能和方便性;例如 要简单的改写原有代码后部署到 GP 中,通过并行计算获得数十倍性 能提高。 另外,GPTEXT(lucent 全文检索)、Apache Madlib(开源挖掘算法)、 SAS algorithm、R 都是通过 UDF 方式实现在 Greenplum 集群中分布 式部署,从而获得库内计算的并行能力。这里可以分享的是,SAS 曾 经做过测试,对 1 亿条记录做逻辑回归,采用一台小型机耗时约 4 个 多小时,通过部署到 可提供一份安装指南《Protegrity 数据库保护器 - Pivotal Greenplum》。该文件列明了在 Greenplum 主档和节点上安装 PEP 服务器所需执行的步骤,它还包含应该如何导入 Greenplum 必要的 UDF(用户自定义函数,即内嵌入数据库的小节点例程)以操作数据 并与 PEP 应用程序通信的相关说明。 为了验证安装是否成功,请查看 Protegrity 功能是否已创建。此外, 请检查 pty_whoami0 码力 | 64 页 | 2.73 MB | 1 年前3
VMware Greenplum 6 DocumentationModules and Extensions 367725 Resolves an issue where an issue with a PL/Python user-defined function (UDF) caused Greenplum master process reset. This fix now reports the error a Python exception. Release can fail due to lack of dependency checking in Greenplum 4 catalog tables. For example, restoring a UDF can fail if it references a custom data type that is created later in the backup file. The INTO error_table use_container_logging. The default is no logging. The PL/Container log information is the information from the UDF that is run in the Docker container. By default, the PL/Container log information is sent to a system0 码力 | 2445 页 | 18.05 MB | 1 年前3
VMware Tanzu Greenplum v6.23 DocumentationTanzu Greenplum 6 Documentation VMware, Inc 464 Greenplum 4 catalog tables. For example, restoring a UDF can fail if it references a custom data type that is created later in the backup file. The INTO error_table use_container_logging. The default is no logging. The PL/Container log information is the information from the UDF that is run in the Docker container. By default, the PL/Container log information is sent to a system troubleshooting a PL/Container UDF, you can change the Greenplum Database log level with the SET command. You can set the parameter in the session before you run your PL/Container UDF. This example sets the0 码力 | 2298 页 | 40.94 MB | 1 年前3
VMware Greenplum v6.25 Documentationcan fail due to lack of dependency checking in Greenplum 4 catalog tables. For example, restoring a UDF can fail if it references a custom data type that is created later in the backup file. The INTO error_table use_container_logging. The default is no logging. The PL/Container log information is the information from the UDF that is run in the Docker container. By default, the PL/Container log information is sent to a system troubleshooting a PL/Container UDF, you can change the Greenplum Database log level with the SET command. You can set the parameter in the session before you run your PL/Container UDF. This example sets the0 码力 | 2400 页 | 18.02 MB | 1 年前3
VMware Greenplum 6 Documentationcan fail due to lack of dependency checking in Greenplum 4 catalog tables. For example, restoring a UDF can fail if it references a custom data type that is created later in the backup file. The INTO error_table use_container_logging. The default is no logging. The PL/Container log information is the information from the UDF that is run in the Docker container. By default, the PL/Container log information is sent to a system troubleshooting a PL/Container UDF, you can change the Greenplum Database log level with the SET command. You can set the parameter in the session before you run your PL/Container UDF. This example sets the0 码力 | 2374 页 | 44.90 MB | 1 年前3
VMware Tanzu Greenplum v6.21 Documentationcan fail due to lack of dependency checking in Greenplum 4 catalog tables. For example, restoring a UDF can fail if it references a custom data type that is created later in the backup file. The INTO error_table use_container_logging. The default is no logging. The PL/Container log information is the information from the UDF that is run in the Docker container. By default, the PL/Container log information is sent to a system troubleshooting a PL/Container UDF, you can change the Greenplum Database log level with the SET command. You can set the parameter in the session before you run your PL/Container UDF. This example sets the0 码力 | 2025 页 | 33.54 MB | 1 年前3
VMware Tanzu Greenplum 6 Documentationcan fail due to lack of dependency checking in Greenplum 4 catalog tables. For example, restoring a UDF can fail if it references a custom data type that is created later in the backup file. The INTO error_table use_container_logging. The default is no logging. The PL/Container log information is the information from the UDF that is run in the Docker container. By default, the PL/Container log information is sent to a system troubleshooting a PL/Container UDF, you can change the Greenplum Database log level with the SET command. You can set the parameter in the session before you run your PL/Container UDF. This example sets the0 码力 | 2311 页 | 17.58 MB | 1 年前3
VMware Greenplum 7 Documentationuse_container_logging. The default is no logging. The PL/Container log information is the information from the UDF that is run in the Docker container. By default, the PL/Container log information is sent to a system troubleshooting a PL/Container UDF, you can change the Greenplum Database log level with the SET command. You can set the parameter in the session before you run your PL/Container UDF. This example sets the performance even if a PL/Container UDF is not running. VMware Greenplum 7 Documentation VMware by Broadcom 747 Python and R call stack information is not displayed when debugging a UDF. The plpy.execute() methods0 码力 | 2221 页 | 14.19 MB | 1 年前3
Greenplum Database 管理员指南 6.2.1相似。如果一个查询使用了外部组件(如 PL/Container),该查询将会有两个部分, 一部分是查询本身在 GP 数据库中运行,而另一部分 UDF 则在 PL/Container 容器中 运行,在 GP 数据库中运行的查询本身由 ROLE 的资源组来管理,在 PL/Container 运行的 UDF 由 PL/Container 的资源组管理,后者在 pg_stat_activity 视图中 无法体现,数据库无法获取 代价愈加的凸显,应 该绝对禁止将尺寸较大的事实表使用复制分布策略! 复制表的主要应用场景: 在复制表上没有UDF无法在Instance上查询的限制,以前,UDF如果访问了业务 表,则不允许在Instance上执行,而如果访问的是复制表,UDF在Instance上 允许对该表进行只读的查询,当然,修改数据的操作仍然是不被允许的。 对于与其他表关联时需要被广播的 在6版本中,删除了Filespace的概念,所以,也删除了pg_filespace和 pg_filespace_entry两张系统表。 对于DDL备份来说,如果UDF引用了的自定义类型或者其UDF,而该自定义类 型或者UDF的定义在DDL备份文件的后面部分,则UDF的恢复会失败,不过可 以尝试多执行几次DDL恢复来解决这类依赖问题。 在4.3版本,可读外部表支持INTO error_table,而该特性从5版本开始已经被0 码力 | 416 页 | 6.08 MB | 1 年前3
VMware Greenplum v6.18 Documentationcan fail due to lack of dependency checking in Greenplum 4 catalog tables. For example, restoring a UDF can fail if it references a custom data type that is created later in the backup file. The INTO error_table language is installed with Greenplum Database. To create and run a PL/Python user-defined function (UDF) in a database, you must register the PL/Python language with the database. Enabling PL/Python Support real, double float numeric decimal other primitive types string SQL null value None 1 When the UDF return type is boolean, the Greenplum Database evaluates the return value for truth according to Python0 码力 | 1959 页 | 19.73 MB | 1 年前3
共 16 条
- 1
- 2













