pdf文档 VMware Greenplum 7 Documentation 推荐

14.19 MB 2221 页 0 评论
语言 格式 评分
英语
.pdf
3
摘要
The document provides guidance on migrating partition maintenance scripts from Greenplum 6 to Greenplum 7, detailing changes in partitioning-related system catalogs, views, and functions. It outlines the removal of certain catalog tables in Greenplum 7 and introduces new ones. The document also covers installation procedures, system configuration, and performance optimization techniques for Greenplum Database. It highlights key differences between Greenplum 6 and 7, including support for new features and extensions, and emphasizes the importance of proper database design and data distribution for optimal performance.
AI总结
### VMware Greenplum 7 Documentation Summary 1. **Partition Maintenance Script Migration** - **Removed Catalogs in Greenplum 7**: `pg_partition`, `pg_partition_columns`, `pg_partition_encoding`, `pg_partition_rule`, `pg_partition_templates`, `pg_partitions`, `pg_stat_partition_operations`, `pg_partition_def()`. - **New Catalogs and Functions**: `gp_partition_template`, `pg_partitioned_table`, `pg_partition_tree()`, `pg_partition_ancestors()`, `pg_partition_root()`. - **Migration Required**: Changes in partitioning-related system catalogs and functions necessitate updates to existing partition maintenance scripts. 2. **Hardware and Network Configuration** - **Supported Platforms**: Greenplum 7 supports reference hardware platforms and operating systems certified by VMware. - **RAID and Disk Configuration**: Logical disks using RAID arrays are recommended for performance and capacity optimization. - **Interconnect**: - Uses a standard Ethernet switching fabric with 10-Gigabit or faster networks. - Default protocol: UDPIFC (high scalability, no 1000-segment limit). - Optional TCP mode (scalability limit of 1000 segments). - Network redundancy: Dual 10Gbps switches and redundant connections recommended. 3. **Database Design and Optimization** - **Partitioning**: Ensure even data distribution across segments for optimal performance. - **Distribution Key**: Choose a key that allows parallel query execution. - **Data Types**: Use the smallest possible data types to reduce storage and improve performance. - **Indexes and Constraints**: Verify index usage and optimize table constraints. - **Statistics**: Use `ANALYZE` to gather statistics for better query planning. - **Maximum Limits**: Greenplum supports unlimited database and table sizes, with row and column limits as specified. 4. **Installation Guide** - **Preparation**: Configure systems, create the `gpadmin` user, and install dependencies. - **Package Installation**: Use `yum` or `apt` to install pre-built RPM/DEB packages. - **Environment Setup**: Set Greenplum environment variables after installation. - **Initialization**: Use `gpinitsystem` to create the database cluster, and configure timezone and localization settings. 5. **Client Tools and Extensions** - **Supported Platforms**: Windows 7/10, Windows Server 2012+, RHEL/Oracle Linux/ Rocky Linux 8/9. - **Installation**: Download and install client tools from VMware Tanzu Network. - **Features**: Includes utilities for data loading, query execution, and extensions for Python, R, and machine learning. 6. **Performance Tuning and Monitoring** - **Statistics**: Use `ANALYZE` to gather column statistics. - **Distribution Key**: Optimize for even data distribution. - **Memory and Disk I/O**: Validate disk performance and memory bandwidth. - **Query Optimization**: Review query plans and adjust statistics as needed. 7. **Data Science and Machine Learning** - **Extensions**: Supports Python and R packages for data science. - **PXF (Platform Extension Framework)**: Enables integration with external data sources (e.g., Kafka, S3). 8. **Greenplum 6 to 7 Differences** - **Naming Changes**: Updated function and catalog names. - **Deprecated Features**: Some features removed or modified. - **Behavioral Changes**: Changes in system catalog behavior and interconnect defaults. 9. **Next Steps** - Allow client connections, create databases, and load data. - Install optional extensions and configure timezone/localization settings. - Review important changes and upgrade instructions for Greenplum 6 users. This summary provides a concise overview of the key features, configurations, and updates in VMware Greenplum 7, focusing on essential information for administrators and developers.
P1
P2
P3
P4
P5
P6
P7
P8
P9
P10
P11
P12
下载文档到本地,方便使用
- 可预览页数已用完,剩余 2209 页请下载阅读 -
文档评分
请文明评论,理性发言.