项目背景:
* 描述:A-Ops是在openEuler社区发起的原创项目,降低大型集群运维困难,推动系统运维的可视化、自动化、智能化,打造高可靠性、高性能、永不中断的基础设施。 * 推荐人:胡峰 @solarhu openEuler TC成员,罗盛炜 @Lostwayzxc openEuler sig-ops maintainer,杨昭 @yangzhao_kl openEuler sig-CloudNative maintainer * 仓库地址: * https://gitee.com/openeuler/A-Ops * https://gitee.com/openeuler/aops-apollo * https://gitee.com/openeuler/aops-diana * https://gitee.com/openeuler/aops-zeus * https://gitee.com/openeuler/gala-gopher * https://gitee.com/openeuler/gala-spider * https://gitee.com/openeuler/gala-anteater * https://gitee.com/openeuler/gala-ragdoll * https://gitee.com/openeuler/syscare * https://gitee.com/openeuler/X-diagnosis
openEuler 优秀项目衡量标准
* 推荐奖项openEuler 年度优秀项目 * 开源开放 项目采用木兰开源协议,代码托管在openEuler * 行业影响 apollo/gala-gopher/gala-ragdoll已经在客户侧验证,即将大规模使用 * 技术创新 以操作系统为观测点基础,基于低负载探针技术,构建端到端可观测性和热修复,实现故障快速发现,辅助定位和系统热修复 * 社区活跃 近一年社区活跃度排名第6,核心开发者107人,参与组织23个,合入PR 6000+,平均PR关闭周期0.58天 * 高质量开发和运营 代码符合规范,代码质量高,文档详尽,托管在官网。PR 提交完整,代码检视流程完整,检视方面和内容详细。对用户反馈问题响应及时,每月2次社区例会。
1. 软件介绍 A-Ops 是一款基于操作系统维度的故障运维平台,提供从数据采集,健康巡检,故障诊断,故障修复的到智能运维解决方案。 云基础设施在近几年随着云原生、无服务化等技术的实施,其运维的复杂性变得越来越有挑战性,尤其是亚健康问题特点(间歇性出现、持续时间短、问题种类多、涉及范围广等)给云基础设施故障诊断带来重要挑战。亚健康故障诊断的挑战(包括可观测能力、海量数据管理能力、AI 算法的泛化能力等)在 Linux 场景中变的尤为突出。在 openEuler 开源操作系统中,现有的运维手段不足以及时发现、定位亚健康问题,存在包括:缺乏在线、持续性监控能力;缺乏应用视角精细化的观测能力;缺乏基于全栈观测数据的自动化、AI 分析能力等问题。然而,针对亚健康故障的诊断能力其难点包括:
* 全栈的无侵入可观测观测能力。
* 持续、精细化、低负载的监控能力。
* 自适应不同应用场景的异常检测、可视化故障推导能力。
* 业务无感的补丁管理、修复。
[cid:image001.png@01D9FD25.6864BCF0]
* gala 项目介绍:基于eBPF + java agent无侵入观测技术,并以智能化辅助,实现亚健康故障
* apollo 项目介绍:智能补丁管理框架,提供 CVE/Bug 实时巡检,冷热补丁修复,实现自动发现和零中断修复
* ragdoll项目介绍:配置导致的故障比例占 OS 问题总数达 50% 以上,ragdoll提供系统配置监控能力,实时发现系统配置变化,快速定位配置错误问题
3. 社区大事记
* 以 AI 赋能 openEuler 更智能,以 openEuler 使能 AI 更高效https://mp.weixin.qq.com/s?__biz=MzI2NDE4OTE2Mg==&mid=2247506918&idx=1&sn=f8ec684dffeb1f595d65b933a2dcb219&chksm=eab2fe63ddc57775dba0deb61fee2699541b58ebfb269e0d9063b35c1332ea0a9bcf5c9fa2bf&sessionid=1696676381&scene=126&subscene=0&clicktime=1696735557&enterid=1696735557&ascene=3&fasttmpl_type=0&fasttmpl_fullversion=6875228-zh_CN-zip&fasttmpl_flag=0&realreporttime=1696735646747&devicetype=android-31&version=28002a3b&nettype=3gnet&abtest_cookie=AAACAA%3D%3D&lang=zh_CN&session_us=gh_db421e50e6cd&countrycode=CN&exportkey=n_ChQIAhIQgCqvkoCxfzxFiRKeT3bTkhLTAQIE97dBBAEAAAAAAOVxL0x%2FOsgAAAAOpnltbLcz9gKNyK89dVj0isHrUP7KR77fCxVz518ffYPMCRjlnc2QOTX5wzp1oy%2FhvBf855I6s4%2F8LpjOKvmhp1LlIUEHtyNycXRGGtm5YjI1URSvOnzjccRZ%2FKztJ%2B6clueLSI5sVOKOlhhi00AB4e5oAD4OFqAELUcrEsRDpDgKHYx7CDgFhorzXpC8ddjUO0G%2BRgWB1U%2BPbZZ77abWaYULU8B0iXA2itpgawHWnlrwtgXSEdIS9WsM4GI%3D&pass_ticket=gzAYhwxfUdLOluQ1yGQOH6cpE5mZfjLmNtv4pfPIa9cf5nBH8AJleQAkdf8H1UTZ&wx_header=3 * 瀚元科技:利用A-OPS 智能运维助力边缘服务器运维效率提升30%https://mp.weixin.qq.com/s?__biz=MzI2NDE4OTE2Mg==&mid=2247505418&idx=1&sn=405f5fc5b032226ad0aae0fae6129bf1&chksm=eab2f38fddc57a99be881419c4a355d0bb45eebbc5d448f6e2fc1960cb946555de496f216fad&sessionid=1696676381&scene=126&subscene=0&clicktime=1696736246&enterid=1696736246&ascene=3&fasttmpl_type=0&fasttmpl_fullversion=6875228-zh_CN-zip&fasttmpl_flag=0&realreporttime=1696736246624&devicetype=android-31&version=28002a3b&nettype=3gnet&abtest_cookie=AAACAA%3D%3D&lang=zh_CN&session_us=gh_db421e50e6cd&countrycode=CN&exportkey=n_ChQIAhIQ0APFeK2MlwQSRL06vbvSrhLiAQIE97dBBAEAAAAAADO3FqFRensAAAAOpnltbLcz9gKNyK89dVj0MKCk924LEj69VzBpuiqzSz%2FaAw95hVt6D%2BYyUG5s0aD2nPCsHML4xd4VUhcAz4oOuNBL1o7DivJj0aCrSQzkEzR5TNmEEO6My7EoWVPMx55xSaRbqAHcm%2F%2Fb5VaiqUJDM7T0uFLLgHydui9H5SUc3kjHzkxvNowDAHy%2BYzJf0hatR3fFoIFrryl89Pts6OHeO%2B8NUSc7dBIkzRjkl%2FEfJ5wNVD1VrSaUrTe%2BoTLSzD79BPTzuNusiTh4ncs%3D&pass_ticket=KdWwp6HEWmoTS8GSn1oid2P0kpYp8zfkfj61vz4RQjELMQ8AX2TYGm2QNSCz4Ilt&wx_header=3 * openEuler南京用户组Meetup运维专场,聚集南京区域用户交流https://mp.weixin.qq.com/s?__biz=MzI2NDE4OTE2Mg==&mid=2247504511&idx=1&sn=eea093caf9dee9c9eb1966c5b7beb552&chksm=eab2f7faddc57eec150b9a219d9ec0eaa4dd184693eedf0c4b061df544aac4cecd0615aa7b89&sessionid=1696676381&scene=126&subscene=0&clicktime=1696736411&enterid=1696736411&ascene=3&fasttmpl_type=0&fasttmpl_fullversion=6875228-zh_CN-zip&fasttmpl_flag=0&realreporttime=1696736411715&devicetype=android-31&version=28002a3b&nettype=3gnet&abtest_cookie=AAACAA%3D%3D&lang=zh_CN&session_us=gh_db421e50e6cd&countrycode=CN&exportkey=n_ChQIAhIQ5JM3bAy0%2FmikcGHjX3h7uhLiAQIE97dBBAEAAAAAAB29B8%2FaWNoAAAAOpnltbLcz9gKNyK89dVj0Ifl7PZl7l2uLZIFkrUGZrEEbdO2R1KpbuaDF2PEn1%2BtKGo5s6BBX5MVYxBXYoqPwPZoyNOeGcKGRnu6%2FwCk0Z%2B9bWQkV5AhxwGt0xtlKNWUVtvUIWkiJwg2gehFl8PIzKvsAxOpAcFP7m4uu%2FciYrgEyT%2B9Ku3zNejNy307RgVJmtNLX0W6i%2FPxkh33f99xfSkFxh%2BLMTwahWYi6j9DOkdSjP0N2cPiI6TlW5V6dTBc9vtwLDZ%2FAVHB8czo%3D&pass_ticket=ZVre9KraEO939RK%2FQiZuhbA5K%2FoRdpjK1M1T94%2FHS5YqG2DeEma1qfBoAqbUG8um&wx_header=3 * SysCare:为您的操作系统保驾护航https://mp.weixin.qq.com/s?__biz=MzI2NDE4OTE2Mg==&mid=2247503429&idx=2&sn=bddedbe5a9ea8d4177eaa54b5e606e30&chksm=eab2ebc0ddc562d68eeec9d9ffb7aa55f92eb3bc1217e9f684c0752a9e4d442ac73819ba59ee&sessionid=1696676381&scene=126&subscene=0&clicktime=1696736454&enterid=1696736454&ascene=3&fasttmpl_type=0&fasttmpl_fullversion=6875228-zh_CN-zip&fasttmpl_flag=0&realreporttime=1696736454367&devicetype=android-31&version=28002a3b&nettype=3gnet&abtest_cookie=AAACAA%3D%3D&lang=zh_CN&session_us=gh_db421e50e6cd&countrycode=CN&exportkey=n_ChQIAhIQEr5WzRi9phg8gRx6bM2J8hLiAQIE97dBBAEAAAAAAGLuIFChVYkAAAAOpnltbLcz9gKNyK89dVj0KCpT5KcIwRgOWuuTbC7x4CmRR02szRUWWw2bGHKHDv7UfD%2B8gpLAs3okO32K%2FXqI9Mu85H4OPUmYq3SdTULY4L6em7rHttrk7MnPW0o6X8n9LBbxgnSDpIfvpe7XdCweyCivCW97Hh1hHvrRD1OivtuSBqlAJezT0vbDiBuDvDPLGtbDazp%2FdU3JUMbqHQ0FeiQBaqpEwdjDxP0h7MZPfXG0znflsc%2FW%2B7fCY8I33ECYlfEzwpswePH%2F2Y4%3D&pass_ticket=MB5OEpdbPKMrkjKmzuAHCoKaJRuIm5PJiefH9RPtKAGVsL8mP2KUqzYY0wUcZADt&wx_header=3 * A-Ops 数据库场景在线应用性能诊断案例https://mp.weixin.qq.com/s?__biz=MzI2NDE4OTE2Mg==&mid=2247502018&idx=1&sn=38067985fe51e9d0cc8d1689f4f13e6c&chksm=eab2ed47ddc56451a9a5e9e52db3fcd07464fb41ce4ddbebde919b400b95e864f561dbf52c34&sessionid=1696676381&scene=126&subscene=0&clicktime=1696736505&enterid=1696736505&ascene=3&fasttmpl_type=0&fasttmpl_fullversion=6875228-zh_CN-zip&fasttmpl_flag=0&realreporttime=1696736505208&devicetype=android-31&version=28002a3b&nettype=3gnet&abtest_cookie=AAACAA%3D%3D&lang=zh_CN&session_us=gh_db421e50e6cd&countrycode=CN&exportkey=n_ChQIAhIQrl4A9qYciLNe9Wnyh4L7ZhLiAQIE97dBBAEAAAAAAI%2FOMBc9CqQAAAAOpnltbLcz9gKNyK89dVj0VP8v4U73H8F0MDgW8pPnbxbQ6CNHg3EDJTPhcdZCxo76zmgpcyJ0vCtV88Ub0ADjMxh1ZFQUAI1oeR%2B%2FhFc9pi3Imy9FdYzZotSXa%2FBfKfepkKDkjhs1eQf52mHfMwQ%2BTysolp7w9Nro6%2BZcss0Hu%2BMgkTSyJtQDgw6u%2FPvsPGz711ukRRjJjq2%2FYW%2Bd%2B79kPIKSfuxebxEyxASbeITbvQew%2FQNAzN2uPJLZZ6Y89q7OY9xemrm7gjWfOZI%3D&pass_ticket=Vz4m5CbcOckYRZd1unkRpM%2FBimVwlcOHYcr%2FiZVzD0SwCrckvsRFd2SKNDKeZDJK&wx_header=3 * openEuler 社区 AI-OPS Meetup 成功举办https://mp.weixin.qq.com/s?search_click_id=1347844628199799088-1696736778807-6505567941&__biz=MzI2NDE4OTE2Mg==&mid=2247487636&idx=2&sn=ad85203a4673e58675fc5f3bf853b100&chksm=eab13511ddc6bc073456464012c7ccdad44b61c9ea99c0112d6f7141936f27bd2a4dd532df7d&scene=7&subscene=10000&sessionid=1696676381&clicktime=1696736778&enterid=1696736778&ascene=65&fasttmpl_type=0&fasttmpl_fullversion=6875228-zh_CN-zip&fasttmpl_flag=0&realreporttime=1696736778831&devicetype=android-31&version=28002a3b&nettype=3gnet&abtest_cookie=AAACAA%3D%3D&lang=zh_CN&countrycode=CN&exportkey=n_ChQIAhIQ9DzxyFaZPOjaKs0CtJCWJxLTAQIE97dBBAEAAAAAADMVNvjCBtMAAAAOpnltbLcz9gKNyK89dVj0ZDoNcba3LdMCM%2FBKjPIsEQY%2F2VUkrvqk3tybJ%2Fm0oMSW3Te8MMqcdK5gHHD3CgbvMcFxE8nNsDzv21OoH8iUYIKVRO2KLcCXHAP2fclTgD4NbIQSsBX2rmFVWYG17R3TkoH1uaRaenMs5nDEyVPsyOL%2BtS%2BI1YF97K1TUDi9XumTFETf%2F0P0kt%2FKQ5Uxpx4H0o05GDRxD1bvygi8nNrC8TN0voByHz8HIdi0stY%3D&pass_ticket=%2FJifusoyrHSGCw03vZsUiO0bvJOlwi8gSgCARbHYmNXiR9K862JDEmR77lvJCcau&wx_header=3 * openEuler 社区成立 OPS SIG,为 openEuler 构建永不中断的基础设施https://mp.weixin.qq.com/s?search_click_id=1347844628199799088-1696736816720-3144230748&__biz=MzI2NDE4OTE2Mg==&mid=2247487113&idx=1&sn=9c8a315d292a8ffb0221dbab5a9707cb&chksm=eab12b0cddc6a21a9ba756275893370f4f90678a03eb6d8a63e2cea0f742af2073ca6dcf6d0d&scene=7&subscene=10000&sessionid=1696676381&clicktime=1696736816&enterid=1696736816&ascene=65&fasttmpl_type=0&fasttmpl_fullversion=6875228-zh_CN-zip&fasttmpl_flag=0&realreporttime=1696736816747#rd