VMware Greenplum 7 Documentation
14.19 MB
2221 页
0 评论
语言 | 格式 | 评分 |
---|---|---|
英语 | .pdf | 3 |
摘要 | ||
The document provides guidance on migrating partition maintenance scripts from Greenplum 6 to Greenplum 7, detailing changes in partitioning-related system catalogs, views, and functions. It outlines the removal of certain catalog tables in Greenplum 7 and introduces new ones. The document also covers installation procedures, system configuration, and performance optimization techniques for Greenplum Database. It highlights key differences between Greenplum 6 and 7, including support for new features and extensions, and emphasizes the importance of proper database design and data distribution for optimal performance. | ||
AI总结 | ||
### VMware Greenplum 7 Documentation Summary
1. **Partition Maintenance Script Migration**
- **Removed Catalogs in Greenplum 7**: `pg_partition`, `pg_partition_columns`, `pg_partition_encoding`, `pg_partition_rule`, `pg_partition_templates`, `pg_partitions`, `pg_stat_partition_operations`, `pg_partition_def()`.
- **New Catalogs and Functions**: `gp_partition_template`, `pg_partitioned_table`, `pg_partition_tree()`, `pg_partition_ancestors()`, `pg_partition_root()`.
- **Migration Required**: Changes in partitioning-related system catalogs and functions necessitate updates to existing partition maintenance scripts.
2. **Hardware and Network Configuration**
- **Supported Platforms**: Greenplum 7 supports reference hardware platforms and operating systems certified by VMware.
- **RAID and Disk Configuration**: Logical disks using RAID arrays are recommended for performance and capacity optimization.
- **Interconnect**:
- Uses a standard Ethernet switching fabric with 10-Gigabit or faster networks.
- Default protocol: UDPIFC (high scalability, no 1000-segment limit).
- Optional TCP mode (scalability limit of 1000 segments).
- Network redundancy: Dual 10Gbps switches and redundant connections recommended.
3. **Database Design and Optimization**
- **Partitioning**: Ensure even data distribution across segments for optimal performance.
- **Distribution Key**: Choose a key that allows parallel query execution.
- **Data Types**: Use the smallest possible data types to reduce storage and improve performance.
- **Indexes and Constraints**: Verify index usage and optimize table constraints.
- **Statistics**: Use `ANALYZE` to gather statistics for better query planning.
- **Maximum Limits**: Greenplum supports unlimited database and table sizes, with row and column limits as specified.
4. **Installation Guide**
- **Preparation**: Configure systems, create the `gpadmin` user, and install dependencies.
- **Package Installation**: Use `yum` or `apt` to install pre-built RPM/DEB packages.
- **Environment Setup**: Set Greenplum environment variables after installation.
- **Initialization**: Use `gpinitsystem` to create the database cluster, and configure timezone and localization settings.
5. **Client Tools and Extensions**
- **Supported Platforms**: Windows 7/10, Windows Server 2012+, RHEL/Oracle Linux/ Rocky Linux 8/9.
- **Installation**: Download and install client tools from VMware Tanzu Network.
- **Features**: Includes utilities for data loading, query execution, and extensions for Python, R, and machine learning.
6. **Performance Tuning and Monitoring**
- **Statistics**: Use `ANALYZE` to gather column statistics.
- **Distribution Key**: Optimize for even data distribution.
- **Memory and Disk I/O**: Validate disk performance and memory bandwidth.
- **Query Optimization**: Review query plans and adjust statistics as needed.
7. **Data Science and Machine Learning**
- **Extensions**: Supports Python and R packages for data science.
- **PXF (Platform Extension Framework)**: Enables integration with external data sources (e.g., Kafka, S3).
8. **Greenplum 6 to 7 Differences**
- **Naming Changes**: Updated function and catalog names.
- **Deprecated Features**: Some features removed or modified.
- **Behavioral Changes**: Changes in system catalog behavior and interconnect defaults.
9. **Next Steps**
- Allow client connections, create databases, and load data.
- Install optional extensions and configure timezone/localization settings.
- Review important changes and upgrade instructions for Greenplum 6 users.
This summary provides a concise overview of the key features, configurations, and updates in VMware Greenplum 7, focusing on essential information for administrators and developers. |
P1
P2
P3
P4
P5
P6
P7
P8
P9
P10
P11
P12
下载文档到本地,方便使用
- 可预览页数已用完,剩余
2209 页请下载阅读 -
文档评分