This article introduces the installation and deployment steps of Greenplum. First, install Greenplum, then configure parameters, and then install the postgis extension.
1, Install and configure GreenPlum
1. Modify the corresponding machine name corresponding to / etc/hosts, which can be viewed through hostname (all nodes)
172.16.1.81 master
172.16.1.82 slave1
2. Create user gpadmin (all nodes)
Passwd gpadmin
3. Modify machine kernel parameters
(the master node is distributed to other nodes) (Note: the same configuration is first configured on the master node, and then copied to other nodes)
Modify the / etc/sysctl.conf file and add the following:
kernel.shmmax = 500000000 kernel.shmmni = 4096 kernel.shmall = 4000000000 kernel.sem = 250 512000 100 2048 kernel.sysrq = 1 kernel.core_uses_pid = 1 kernel.msgmnb = 65536 kernel.msgmax = 65536 kernel.msgmni = 2048 net.ipv4.tcp_syncookies = 1 net.ipv4.ip_forward = 0 net.ipv4.conf.default.accept_source_route = 0 net.ipv4.tcp_tw_recycle = 1 net.ipv4.tcp_max_syn_backlog = 4096 net.ipv4.conf.all.arp_filter = 1 net.ipv4.ip_local_port_range = 1025 65535 net.core.netdev_max_backlog = 10000 net.core.rmem_max = 2097152 net.core.wmem_max = 2097152 vm.overcommit_memory = 2 Execute the command to validate the parameter: sysctl –p
4. Modify file opening limit (all nodes)
Modify the / etc/security/limits.conf file and add the following:
* soft nofile 65536 * hard nofile 65536 * soft nproc 131072 * hard nproc 131072
5. Install on the greenplus master node
Two versions can be downloaded from the official website, a binary version and an rpm version
unzip greenplum-db-5.8.0-rhel7-x86_64.zip
Start installation execution
./greenplum-db-5.8.0-rhel7-x86_64.bin
Our installation path is / home / gpadmin / greenplus / greenplus-5.16.0
ll /home/gpadmin/greenplum/
Successfully installed on the master node
6. Start installing other nodes
On the master node
Cd /home/gpadmin/greenplum/ Mkdir conf
Create a new file hostlist and add the following:
Master Slave1
Also create a new seg_hosts file and add the following:
Slave1
Execute command
Su gpadminsource /home/gpadmin/greenplum/greenplum-db/greenplum_path.sh gpssh-exkeys -f /home/gpadmin/greenplum/conf/hostlist
The test shown in the figure represents success.
7. Distribute the installation package to the gpadmin user of other nodes for execution
tar -cf gp.5.16.0.tar greenplum-5.16.0/ pack gpscp -f /home/gpadmin/greenplum//conf/seg_hosts gp.5.16.0.tar =:/home/gpadmin/greenplum/
Enter the / home / gpadmin / greenplus / conf directory and execute the command
gpssh -f seg_hosts
cd /home/gpadmin/greenplum [slave1] tar –xf gp.5.16.0.tar ln -s ./greenplum-db-5.16.0 greenplum-db [slave1] exit
8. Initialize the database
/Create in the directory home / gpadmin / greenplus /
mkdir /data
Modify permissions
chown gpadmin.gpadmin /data
Enter the / home / gpadmin / greenplus / conf directory
gpssh -f hostlist cd /home/gpadmin/greenplum/data mkdir –p /home/gpadmin/greenplum/data/master mkdir –p /home/gpadmin/greenplum/data/primary mkdir –p /home/gpadmin/greenplum/data/mirror
Configure environment variables in the master node
vi /home/gpadmin/.bashrc
Add at the end
9. Modify initialization configuration file
Initialize the Greenplum configuration file template in / usr / local / Greenplum dB / docs / cli_ Under the help / gpconfigs directory, gpinitsystem_config is the template for initializing Greenplum. The configuration of Mirror Segment in this template is annotated; Create a copy and modify its following configuration:
cd /home/gpadmin/greenplum/greenplum-db/docs/cli_help/gpconfigs cp gpinitsystem_config initgp_config vi initgp_config
The modified content is that 4-8 data nodes are recommended, and 4 are installed in the system
declare -a DATA_DIRECTORY=(/home/gpadmin/greenplum/data/primary /home/gpadmin/greenplum/data/primary /home/gpadmin/greenplum/data/primary /home/gpadmin/greenplum/data/primary )
MASTER_HOSTNAME=master #Master node name MASTER_DIRECTORY=/home/gpadmin/greenplum/data/master MASTER_DATA_DIRECTORY=/home/gpadmin/greenplum/data/gpseg-1 DATABASE_NAME=gp_sydb #Consistent with the configured initialization database name MACHINE_LIST_FILE=(/home/gpadmin/greenplum /conf/seg_hosts
Perform initialization
gpinitsystem -c initgp_config –S
If the initialization is successful, the installation is successful
2, Install the postgis plug-in and download the gpkg file from the official website
1. Perform the same as the installation steps.
2. Modify the configuration file
Vi hosts_expand Slave2 Slave3
3. Create myexpand database
4. Execute orders
gpexpand -D myexpand
5. Check whether the test is successful:
select a.dbid,a.content,a.role,a.port,a.hostname,b.fsname,c.fselocation from gp_segment_configuration a ,pg_filespace b,pg_filespace_entry c where a.dbid=c.fsedbid and b.oid=c.fsefsoid order by content;
3, Start and log in to the database
su gpadmin gpstop -M fast gpstart -a psql -h 172.16.1.81 -p 5432 -d yourDB -U gpadmin --Enter the password according to the prompt, and the login is successful
4, Common commands for GreenPlum operation
\c --Connect and switch databases(The database name is only lowercase, but postgres Is case sensitive) \dt --List all tables \l --List all databases SELECT block_id,block_name FROM t_aoi_110000 CROSS JOIN ST_MakeEnvelope(116.343242,39.868524,116.362350,39.872238,4326) As geom_src WHERE ST_Intersects(geom,geom_src);
Scan the code to join my knowledge Planet: "spatio-temporal big data", you can obtain more and higher quality knowledge and documents in maps, geographic information, GIS, big data, spatio-temporal analysis, real 3D, map visualization, etc. the top 20 can enjoy a 50% discount coupon.