■Hadoop Conference Japan 2014 講演資料
https://hcj2014.eventbrite.com/
『A Deeper Understanding of Spark Internals』
Patrick Wendell (Databricks)
The document outlines a step-by-step guide for creating a Fast DDS (Data Distribution Service) pub/sub application using Visual Studio and command line tools. It covers installation, type definition, code generation, configuration, building, and execution processes for a HelloWorld example. The reader is instructed to perform specific commands and setups for both publisher and subscriber functionalities.
Software design as a cooperative game with EventStormingAlberto Brandolini
The document discusses the practice of software design through collaborative techniques like event storming to foster consensus and understanding among stakeholders. It emphasizes the importance of visualizing the entire process, engaging diverse experts, and addressing both policies and values to enhance team cooperation towards common goals. The document also highlights challenges in collaboration and strategies to navigate them for effective outcomes.
The document outlines a step-by-step guide for creating a Fast DDS (Data Distribution Service) pub/sub application using Visual Studio and command line tools. It covers installation, type definition, code generation, configuration, building, and execution processes for a HelloWorld example. The reader is instructed to perform specific commands and setups for both publisher and subscriber functionalities.
Software design as a cooperative game with EventStormingAlberto Brandolini
The document discusses the practice of software design through collaborative techniques like event storming to foster consensus and understanding among stakeholders. It emphasizes the importance of visualizing the entire process, engaging diverse experts, and addressing both policies and values to enhance team cooperation towards common goals. The document also highlights challenges in collaboration and strategies to navigate them for effective outcomes.
10 Reasons to Start Your Analytics Project with PostgreSQLSatoshi Nagayasu
PostgreSQL provides several advantages for analytics projects:
1) It allows connecting to external data sources and performing analytics queries across different data stores using features like foreign data wrappers.
2) Features like materialized views, transactional DDLs, and rich SQL capabilities help build effective data warehouses and data marts for analytics.
3) Performance optimizations like table partitioning, BRIN indexes, and parallel queries enable PostgreSQL to handle large datasets and complex queries efficiently.
PgDay Asia 2016 was the first pan-Asia PostgreSQL conference held in Singapore in March 2016. It was organized alongside FOSSASIA 2016 and had over 19 speakers from 9 countries give talks on PostgreSQL topics to an audience of around 100 attendees. Sessions included keynotes on the past, present and future of PostgreSQL and talks on performance, scaling, migration from Oracle, and using TABLESAMPLE for big data analysis. PgDay Asia 2017 was announced to take place again at FOSSASIA 2017 in the same region with an aim for an even better and larger event.
A Story Behind the Conference, or How pgDay Asia was bornSatoshi Nagayasu
This document describes the origins of the pgDay Asia PostgreSQL conference. It explains that Satoshi Nagayasu, chair of the Japan PostgreSQL Users Group, wanted to introduce PostgreSQL to Asian audiences and involve Asian communities in the global PostgreSQL community. He began traveling around Asia to meet people and gauge interest. With help from others in Asia and at FOSSASIA, the first pgDay Asia was held in 2016 as a joint event with FOSSASIA in Singapore. The document expresses gratitude to all who helped make the inaugural Asian PostgreSQL conference a reality.
PostgreSQL 9.4, 9.5 and Beyond @ COSCUP 2015 TaipeiSatoshi Nagayasu
The document provides an overview of new features in PostgreSQL versions 9.4 and 9.5, including improvements to NoSQL support with JSONB and GIN indexes, analytics functions like aggregation and materialized views, SQL features like UPSERT, security with row level access policies, replication capabilities using logical decoding, and infrastructure to support parallelization. It also outlines the status and changes between versions, and resources for using and learning about PostgreSQL.
Satoshi Nagayasu from the Japan PostgreSQL Users Group is proposing a pgDay Asia event to be held jointly with FOSSASIA 2016 in Singapore from March 18-20. FOSSASIA 2015 had over 900 attendees from 27 countries for its 3-day conference. The proposed pgDay Asia would occupy 1 day of FOSSASIA 2016 with 2 tracks (to be determined) focused on PostgreSQL in Asia. This is still a work in progress but attendees are asked to mark their calendars now for the joint event.
PostgreSQL 9.4 and Beyond @ FOSSASIA 2015 SingaporeSatoshi Nagayasu
This document summarizes Satoshi Nagayasu's presentation on PostgreSQL 9.4 and beyond. Key highlights include: improvements to JSON support with the new JSONB data type and GIN indexes; new aggregation functions and materialized views for analytics; logical decoding for more flexible replication; and infrastructure changes like dynamic background workers and shared memory to support parallelization. Upcoming features like BRIN indexes are also mentioned.
This document summarizes a presentation about new features in PostgreSQL 9.4. It discusses enhancements for NoSQL support with JSON and GIN indexes, analytics with new aggregate functions and materialized views, increased flexibility with logical replication, easier administration using ALTER SYSTEM, and improved infrastructure for parallelization through dynamic background workers and shared memory. The presentation provides an overview of the status of 9.4 and highlights some of the major new categories of features.
The document discusses the history and current state of the Japan PostgreSQL Users Group (JPUG), a non-profit organization that promotes PostgreSQL usage in Japan. It describes how JPUG was founded 15 years ago after an active mailing list emerged in the late 1990s. Currently, JPUG focuses on providing PostgreSQL information, educating users through meetups, and connecting the community. It has over 50 members and 9 regional branches that hold conferences and events to advance PostgreSQL in Japan and connect with other Asian communities.
海外の技術カンファレンスに行こう! Let’s go tech conferences overseas!Satoshi Nagayasu
This document discusses the benefits of attending technical conferences overseas. It lists the speaker's experience attending conferences like LinuxWorld and pgcon in San Francisco, Ottawa, Shenzhen. The top three reasons provided to attend are: 1) To learn the latest technologies and trends, 2) To gain a global perspective and experience diversity, 3) To meet people and make connections. The document encourages attending at least one overseas conference per year, and mentions pgcon 2015 as an upcoming option.
13. WITH句
WITH clause
• そのクエリのための一時テーブルを定義する
• サブクエリ内で2回以上参照する場合には、パフ
ォーマンスが良くなる
• 「サブクエリのサブクエリ(の…)」をしなくて
済むので、クエリがシンプルになる
WITH foo AS (
SELECT ... FROM ... GROUP BY ...
)
SELECT ... FROM foo WHERE ...
UNION ALL
SELECT ... FROM foo WHERE ...;
https://www.postgresql.org/docs/9.5/static/queries-with.html
19. JSONデータ型
JSON data type
testdb=# select n_nationkey,n_name from nation where
n_nationkey = 12;
n_nationkey | n_name
-------------+---------------------------
12 | JAPAN
(1 row)
testdb=# select jsonb_build_object('n_nationkey', n_nationkey,
'n_name', n_name) from nation where n_nationkey = 12;
jsonb_build_object
------------------------------------------------------------
{"n_name": "JAPAN ", "n_nationkey": 12}
(1 row)
20. Operator Description
9.4
-> Get an element by key as a JSON object
->> Get an element by key as a text object
#> Get an element by path as a JSON object
#>> Get an element by path as a text object
<@, @> Evaluate whether a JSON object contains a key/value pair
? Evaluate whether a JSON object contains a key or a value
?| Evaluate whether a JSON object contains ANY of keys or values
?& Evaluate whether a JSON object contains ALL of keys or values
9.5
|| Insert or Update an element to a JSON object
- Delete an element by key from a JSON object
#- Delete an element by path from a JSON object
http://www.postgresql.org/docs/9.5/static/functions-json.html
21. JSONデータ型
JSON data type
• スキーマを定義しなくてもデータを収集できる
• “Schema-less”, “Schema on Read” あるいは
“Schema-later”.
• SQLからもアクセス可能
JSON
Data Type
Fluentd
pg-Json plugin
View
(Schema)
App
App
Fluentd
22. パフォーマンス
3 types of Join
Full text search (n-gram)
Table Partition
BRIN Index
Table Sample
Parallel Queries
41. ユーザ定義関数 by Python
UDF by Python
CREATE OR REPLACE FUNCTION dumpenv(OUT text, OUT text)
RETURNS SETOF record
AS $$
import os
for e in os.environ:
plpy.notice(str(e) + ": " + os.environ[e])
yield(e, os.environ[e])
$$ LANGUAGE plpythonu;
42. ユーザ定義関数 by Python
UDF by Python
CREATE OR REPLACE FUNCTION dumpenv(OUT text, OUT text)
RETURNS SETOF record
AS $$
import os
for e in os.environ:
plpy.notice(str(e) + ": " + os.environ[e])
yield(e, os.environ[e])
$$ LANGUAGE plpythonu;
testdb=# select * from dumpenv() order by 1 limit 10;
column1 | column2
--------------------+-----------------------
G_BROKEN_FILENAMES | 1
HISTCONTROL | ignoredups
HISTSIZE | 1000
HOME | /home/snaga
HOSTNAME | localhost.localdomain
LANG | ja_JP.UTF-8
LC_COLLATE | C
LC_CTYPE | C
LC_MESSAGES | C
LC_MONETARY | C
(10 rows)
52. ユースケース
Apache MADlib (Incubating) User Survey Results Oct 2016
http://madlib.incubator.apache.org/community-artifacts/Apache-MADlib-user-survey-results-Oct-2016.pdf
53. 機能
MADlib: Distributed In-Database Machine Learning for Fun and Profit
https://archive.fosdem.org/2016/schedule/event/hpc_bigdata_madlib/