Loading…
现场活动
9月26日至28日
了解更多注册参加

Sched应用程式允许您创建日程,但不能替代您的活动注册。您必须先注册KubeCon + CloudNativeCon + Open Source Summit China 2023 才能参加会议。如果您还未注册但希望加入我们,请前往活动注册页面购买注册。

请注意:此日程以中国标准时间(UTC +8)自动显示。若要查看您首选时区的日程,请从右侧顶部的"Timezone"下拉菜单选择首选时区。日程可能会有变动,并且会议席位按照先到先得的原则提供。

In-person
September 26-28
Learn More and Register to Attend

The Sched app allows you to build your schedule but is not a substitute for your event registration. You must be registered for KubeCon + CloudNativeCon + Open Source Summit China to participate in the sessions. If you have not registered but would like to join us, please go to the event registration page to purchase a registration.

Please note: This schedule is automatically displayed in China Standard Time (UTC +8). To see the schedule in your preferred timezone, please select from the drop-down menu to the right, above "Filter by Date." The schedule is subject to change and session seating is available on a first-come, first-served basis. 
Wednesday, September 27 • 2:45pm - 3:20pm
从零到无穷大:如何基于AI技术的对冲基金在Kubernetes上构建云原生AI平台 | From Zero to Infinity:How AI-Powered Hedge Fund Build Cloud-Native AI Platform on Kubernetes - Yang Che, Alibaba Cloud & Zhiyi Li, Metabit Trading

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Metabit Trading是一家使用人工智能技术的量化投资公司,他们在K8s上构建了他们的研究平台。然而,他们的计算平台经常面临突然的任务需求,需要从分布式存储系统中并发访问数据的0到数百个pods的扩展。由于这些系统的评级限制和速度较慢,它们显著影响了训练性能并限制了计算可扩展性。 为了解决这个问题,他们使用了Fluid和JuiceFS来构建一个弹性分布式缓存解决方案。在本次会议中,Metabit和Fluid的专家将讨论如何通过创建一个弹性缓存集群并使用Prometheus根据缓存使用行为特征制定策略,在生产环境中实现自动扩展,最终达到1000Gbps。他们还将介绍如何使用Fluid和CronHPA进行定时自动缩放,以平衡成本和性能,并评估扩展的性能和成本效益,并展示解决方案的演示。

Metabit Trading is an AI-powered quantitative investment firm that builds their research platform on K8s. However, their computing platform often faces sudden tasks requiring scaling from 0 to hundreds of pods for the concurrent access of data from distributed storage systems. As these systems are rating-limited and slow, they significantly hamper training performance and limit compute scalability. To tackle this, they used Fluid, a CNCF project, and JuiceFS to build an elastic distributed cache solution. In this session, experts from Metabit and Fluid will discuss how to achieve automatic scaling in production environments by creating an elastic cache cluster and using Prometheus to set up a strategy based on behavioral characteristics of cache usage, ultimately reaching 1000Gbps. They will also cover using Fluid with CronHPA for timed autoscaling to balance cost and performance while evaluating the performance and cost benefits of scaling, and present a demo showcasing the solution.

Speakers
avatar for Zhiyi Li

Zhiyi Li

software engineer, metabit trading 乾象投资
Zhiyi Li is a senior engineer at Metabit Trading AI Infra team. Her primary responsibility is to productionize research-related tools and platforms in Kubernetes. She is actively involved in implementing and managing cloud-native solutions that leverage Argo, Prometheus, Elasticsearch... Read More →
avatar for Yang Che

Yang Che

senior engineer, Alibaba Cloud
Yang Che, is a senior engineer of Alibaba Cloud. He works in Alibaba cloud container service team, and focuses on Kubernetes and container related product development. Yang also works on building elastic machine learning platform on those technologies. He is an active contributor... Read More →



Wednesday September 27, 2023 2:45pm - 3:20pm CST
2层 会议室 2 | 2F Room 2
  数据+处理+存储 | Data + Processing + Storage