湾区同学技术沙龙
(Bay Area) Transwarp(星环科技) && DistributedLog
17 March 2017
6:30 PM – 9:30 PM, 3/17/2017, Friday
Registration
- Registration link: tech-meetup-03-17-2017.eventbrite.com/
- Event link: (Bay Area) Transwarp(星环科技) && DistributedLog
Event Info
- Language: Chinese
- Time: 6:30 PM – 9:30 PM, 3/17/2017, Friday
- Location: 1059 E Meadow Cl, Room A, Palo Alto, CA 94043
Agenda
- 6:30pm - 8:00pm: Key Innovations and Usage Scenarios of Transwarp Data Hub
- 8:00pm - 9:00pm: Building reliable real-time services with Apache DistributedLog
- 9:00pm - 9:30pm: Offline Networking
Talk 1: Key Innovations and Usage Scenarios of Transwarp Data Hub (Transwarp)
Big data technology is rapidly changing the IT industry in China. Among those technologies, Hadoop is the best-known one and keeps growing its popularity. Transwarp Data Hub (shortened to TDH) has been one of the mainstream Hadoop distribution recognized by Gartner since 2016. As a one-stop big data platform, TDH has many new cutting-edge technologies which revolutionarily improve the usability, performance and stability so that enterprise can build core business system and create new applications in a more cost-saving and effort-saving manner. This topic will talk about the technical innovations of TDH, and also brief introduce the usage scenarios for those technique.
The topic will cover:
- The SQL engine with full SQL compatibility and extreme performance
- A in-memory or SSD acclerated columnar store
- Unified event-driven and batch processing engine for streaming computation
- Business scenarios for those technique
Talk 2: Building reliable real-time services with Apache DistributedLog (Sijie, Twitter)
Apache DistributedLog (incubating) is a low-latency, high-throughput replicated log service. Sijie Guo shares how Twitter has used DistributedLog as the real-time data foundation in production for years, supporting services like distributed databases, pub-sub messaging, and real-time stream computing and delivering more than 1.5 trillion (17 PB) events per day.
Topics include:
- Overview of Apache DistributedLog
- How Twitter uses DistributedLog to build strong consistency in its databases
- How Twitter uses DistributedLog for data replication across regions
- How Twitter uses DistributedLog for pub-sub and real-time stream computing
- Lessons learned from production
Speaker Bio
- Yuanhao Sun, Founder & CTO of Transwarp
Yuanhao Sun graduated from Nanjing University majored in Computer Science and joined Intel in 2003. Before starting Transwarp, he worked as CTO of Data Center Software Division of Intel Asia-Pacific Research and Development Center in charge of the development and commercialization of Intel's enterprise Hadoop.
Yuanhao left Intel in 2013 to start Transwarp which focus on high-efficiency computing engine and data analysis algorithms on Apache Hadoop. Yuanhao was the founder of Intel's enterprise Hadoop. He led the team to make multiple improvements and enhancements to meet different industry’s needs. Yuanhao and his team made great contribution to big data development in China with hundreds of successful Big Data use cases.
- Wanggen Liu, Director of Transwarp
Wanggen is the R&D director of Transwarp, and responsible for the engineering and production of TDH. He joined Transwarp in 2013 and led the effort of bringing SQL and transactions into TDH; before that he was a chip architecture in NVIDIA and focused on GPU SM performance. He has rich engineering experience in distributed computation, compiler technology and computer architecture.
- Sijie Guo, Messaging Group TL of Twitter
Sijie (github.com/sijie) is the tech lead of Twitter’s Messaging group, and he is the cocreator of Apache DistributedLog and the PMC chair of Apache BookKeeper as well.
About Transwarp:
Transwarp Technology (Shanghai) Co., Ltd is one of the very few high technology companies in China that have truly mastered essential big data technologies. Transwarp specializes in R&D and service of enterprise big data infrastructure software products which has been positioned furthest for its completeness of vision in the visionaries' quadrant in the Gartner 2016 Magic Quadrant for Data Warehouse and Data Management Solutions for Analytics, also the only vendor from PRC. Transwarp is famous as one of the six mainstream Hadoop Distribution Vendors worldwide. Transwarp products include big data integrated platform ‘Transwarp Data Hub (abbrv. TDH)’, hyper-converged big data appliance ‘TxData Appliance’ and cloud OS ‘Transwarp OS’. So far, TDH has been successfully deployed in China in industries including telecommunications, finance, transportation, utilities as well as public sectors, making Transwarp the big data infrastructure vendor with the most deployments in China.
主办
协办
- 中国科技大学校友会创业俱乐部
- 南京大学硅谷校友会
- 中国科大硅谷校友会
- 北加州清华校友会
- 瀚海硅谷科技园
- 硅谷清华联网
- 浙江大学校友会海纳创新创业俱乐部
- 北京大学北加州校友会
- 武汉大学北加州校友会
- 东南大学硅谷校友会
- 吉林大学硅谷校友会
- 复旦大学北加州校友会
- 华人事业互助会
- 北加州华中科技大学校友会
- 北京航空航天大学硅谷校友会
- 北京邮电大学北美校友会
- 上海交通大学硅谷校友会
- 兰州大学北加州校友会
- 电子科技大学硅谷校友会
- 旧金山湾区南开校友会
- 硅谷天津大学校友会
- 北京理工大学硅谷校友会
- 安徽大学北美校友会
- 湖南大学北美校友会
- 湘潭大学北美校友会
- 哈工大硅谷校友会
- 中山大学海外校友联网
- 长城会 RobotX Space
- 硅谷商学院
- 斯坦福中国学生学者联谊会(ACSSS)
Related articles
- (Bay Area) Snowflake / Databricks / OceanBase
- (Bay Area) 云端数据中台:数据编排与平台运维
- (Bay Area) Google Doc 是如何炼成的 - 深入浅出协同编辑/Deep Dive Collaborative Editing
- (Bay Area) An introduction of Analytics Zoo and how to use it at Uber
- (Bay Area) Tensorflow.JS: Bringing Machine Learning To The Web And Beyond
- (Bay Area) Weakly Supervised Natural Language Understanding / 基于弱监督学习的自然语言理解 By Mosaix.ai
- (Bay Area) Data Extraction Revolution in Bloomberg, From Human Typing To Deep Learning Excerpting
- (Bay Area) Next-Generation AI Powered Operation System
- (Bay Area) Power Blockchain with Hardware Innovations
- (Bay Area) 区块链产业现状及技术发展(阿里巴巴技术日)
- (Bay Area) Anatomizing Blockchain through Many Views(区块链折叠)
- (Bay Area) Deep Dive of Alluxio and Google gVisor
- (Bay Area) 技术创造新商业:阿里巴巴搜索推荐&计算平台事业部硅谷开放日
- (Bay Area) Google Translate助力自然语言理解
- (Bay Area) Alibaba Tech Open Day – AI, Cloud, Infrastructure and More
- (Bay Area) 通向区块链3.0的未来之路
- (Bay Area) Alibaba New Retail / Hema Tech Day (盒马生鲜技术日)
- (Bay Area) exGoogle Leaders, leap.ai co-founders share their career stories & insights (Richard Liu, Yunkai Zhou)
- (Bay Area) Augmented Intelligence to Improve Health Care Consumer Experience
- (Bay Area) GrowingIO 湾区技术同学见面会
- (Bay Area) Alibaba Technology Forum, Stanford University
- (Bay Area) How Pinterest Perfected New User Onboarding
- (Bay Area) Tencent Tech Day - Silicon Valley
- (Bay Area) Deep dive of DeepMap (Wei Luo)
- (Bay Area) Apache Kafka: The Rise of Real-time
- (Bay Area) 苏宁机器学习平台及Buddy AI人工智能自动客服系统技术分享
- (Bay Area) JD.com Tech Day - Leverage Technology to empower business intelligence
- (Shanghai) 采用超低功耗AI技术的小MU机器人的实现与应用
- (Bay Area) AI in Service robotics and Mini Robot
- (Shanghai) Google SRE如何管理数据中心
- (Bay Area) 如何用1/6000的训练数据击败深度学习——文字识别实验讨论
- (Shanghai) Twitter Heron Streaming at Scale
- (Bay Area) AI大牛谈深度学习最新进展
- (Bay Area) 新一代创新搜索技术架构讨论专场
- (Bay Area) CAINIAO Technology Forum, Silicon Valley
- (Bay Area) How to build a NewSQL database? (Qi Liu)
- (Bay Area) The Evolution of Big Data APIs in Spark (Reynold Xin)
- (Bay Area) TensorFlow: A Large-Scale Machine Learning System (Zhifeng Chen)
- (Bay Area) Ant Financial Tech Forum (2016蚂蚁金服技术湾区论坛)
- (Bay Area) Espresso: LinkedIn’s Distributed Database (Yun Sun)
- (Bay Area) Virtual Reality & Augmented Reality (Guodong Rong)
- (Bay Area) Etcd: A key-value store Open Source for Data consistency, Data persistency, Data synchronization in Distributed system (Xiang Li)
- (Bay Area) Introduction To OpenStack (Weidong Shao & Xin Wu)
- (Bay Area) A Journey of AI: from Silicon Valley to Beijing, from Big Name to Startup (Kai Yu)
- (Bay Area) CoreOS rkt, a Container Runtime (Yifan Gu)
- (Bay Area) Borg: Large-scale Cluster Management at Google (Xiao Zhang)
- (Bay Area) Spark MLlib: Past, Present and Future (Xiangrui Meng)
- (Bay Area) Cassandra: an open source distributed database (Charles Cao)
- (Bay Area) Tachyon: an open source memory-centric distributed storage system (Bin Fan / Shaoshan Liu / Haoyuan Li)
- (Bay Area) Apache Samza: a distributed stream processing framework (Yi Pan)
- (Bay Area) 大数据时代的金融服务创新 (Li Cheng)
- (Bay Area) 大数据人工智能 (Kai Yu)
- (Bay Area) Photon: Fault-tolerant and scalable joining of continuous data streams (Tianhao Qiu)
- (Bay Area) Large-scale data science and engineering with Spark (Reynold Xin)
- (Bay Area) Building a real time data platform with Apache Kafka (Jun Rao)
- (Bay Area) Kubernetes: Google’s secret weapon for Cloud computing (Dawn Chen)
- (Bay Area) Tachyon: A Reliable Memory-Centric Distributed Storage System