<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"><channel><title>Tolshao</title><description>探物及理 | BUAA | 技术博客</description><link>https://blog.tolshao.xyz/</link><item><title>二刷 NASA 电机设计备忘录：“多物理场耦合”才是通关秘籍</title><link>https://blog.tolshao.xyz/posts/nasa_motor/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/nasa_motor/</guid><description>二刷 NASA 电机设计备忘录：“多物理场耦合”才是通关秘籍</description><pubDate>Fri, 23 Jan 2026 07:05:44 GMT</pubDate></item><item><title>FOC 就是把交流当直流使？老列带你捅破这层窗户纸</title><link>https://blog.tolshao.xyz/posts/foc1/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/foc1/</guid><description>FOC 就是把交流当直流使？老列带你捅破这层窗户纸</description><pubDate>Wed, 21 Jan 2026 03:21:41 GMT</pubDate></item><item><title>ios黄页支持订阅啦，外卖、快递、推销一眼便知！</title><link>https://blog.tolshao.xyz/posts/ios-yellow-page-update/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/ios-yellow-page-update/</guid><description>还在手动导入 vCard？快来试试 CardDAV 订阅吧，一劳永逸，自动更新！</description><pubDate>Thu, 24 Jul 2025 02:00:00 GMT</pubDate></item><item><title>Python-Latex主题分享</title><link>https://blog.tolshao.xyz/posts/python%E4%B8%BB%E9%A2%98%E5%88%86%E4%BA%AB/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/python%E4%B8%BB%E9%A2%98%E5%88%86%E4%BA%AB/</guid><description>&apos;人生苦短，我用python,Life is short, you need Python——Bruce Eckel&apos;</description><pubDate>Fri, 24 Sep 2021 07:00:07 GMT</pubDate></item><item><title>mac开启HiDPI</title><link>https://blog.tolshao.xyz/posts/mac%E5%BC%80%E5%90%AFhidpi/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/mac%E5%BC%80%E5%90%AFhidpi/</guid><description>让你的显示器支持苹果的HiDPI黑科技，用降低分辨率的代价获得更好的显示效果「手动狗头」，对你的眼睛好一点</description><pubDate>Wed, 06 Jan 2021 18:41:19 GMT</pubDate></item><item><title>Getting Started with gym</title><link>https://blog.tolshao.xyz/posts/rl_gym_start/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_gym_start/</guid><description>OpenAI gym 的入门教程，参考自官网的gym手册</description><pubDate>Tue, 08 Sep 2020 07:36:04 GMT</pubDate></item><item><title>强化学习：控制工程师帮你醍醐灌顶</title><link>https://blog.tolshao.xyz/posts/rl_matlab_youtube/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_matlab_youtube/</guid><description>youtube上官方matlab下深度学习RL课程笔记，从工程师的角度宏观上概述了RL问题的所有关键点和注意点</description><pubDate>Tue, 08 Sep 2020 06:55:39 GMT</pubDate></item><item><title>RL实践3——为Agent添加Policy、记忆功能</title><link>https://blog.tolshao.xyz/posts/rl_pr3/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_pr3/</guid><description>利用SARSA（0）的学习方法，帮助agent学习到价值函数(表），进而选取动作。</description><pubDate>Thu, 03 Sep 2020 03:41:39 GMT</pubDate></item><item><title>RL实践2——RL环境gym搭建</title><link>https://blog.tolshao.xyz/posts/rl_pr2_gym/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_pr2_gym/</guid><description>算法研究者，可以快速利用多种不同的环境验证迭代自己的算法有效性。算法应用，可以效仿gym中的接口，搭建自己的环境。</description><pubDate>Thu, 03 Sep 2020 02:11:11 GMT</pubDate></item><item><title>RL实践1——动态规划值迭代</title><link>https://blog.tolshao.xyz/posts/rl_pr_1_dp/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_pr_1_dp/</guid><description>实现用 动态规划 值迭代 的方法，求解格子世界中的随机策略价值函数</description><pubDate>Wed, 02 Sep 2020 09:27:13 GMT</pubDate></item><item><title>强化学习笔记10：经典游戏示例 classic games</title><link>https://blog.tolshao.xyz/posts/rl_10/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_10/</guid><description>介绍RL历史中的经典案例</description><pubDate>Thu, 27 Aug 2020 07:12:42 GMT</pubDate></item><item><title>强化学习笔记9：探索和利用 exploration and exploitation</title><link>https://blog.tolshao.xyz/posts/rl_9/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_9/</guid><description>利用让Agent更稳定，探索让Agent上限更高，二者不可得兼，平衡一下吧</description><pubDate>Sun, 23 Aug 2020 10:35:12 GMT</pubDate></item><item><title>解锁播放器的隐藏功能👀用过的都说好😎</title><link>https://blog.tolshao.xyz/posts/iina_potplayer/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/iina_potplayer/</guid><description>教你用浏览器看电视，全球的频道都可以access，跳广告的什么的也都在这儿了</description><pubDate>Fri, 21 Aug 2020 00:49:03 GMT</pubDate></item><item><title>免费图床搭建:Github+Picgo+jsDelivr</title><link>https://blog.tolshao.xyz/posts/img_bed/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/img_bed/</guid><description>免费图床，稳定可靠，结合CDN加速，棒了</description><pubDate>Wed, 19 Aug 2020 10:35:50 GMT</pubDate></item><item><title>强化学习笔记8：整合学习和规划</title><link>https://blog.tolshao.xyz/posts/rl_8/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_8/</guid><description>规划是基于模型的搜索，学习是基于数据的总结，二者结合，1+1&gt;2</description><pubDate>Mon, 17 Aug 2020 08:05:59 GMT</pubDate></item><item><title>hexo 进阶设置指南（持续更新）</title><link>https://blog.tolshao.xyz/posts/hexo_advanced/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/hexo_advanced/</guid><description>一点点装饰你的房子，让它变得更漂亮</description><pubDate>Wed, 12 Aug 2020 15:19:44 GMT</pubDate></item><item><title>强化学习笔记7：策略梯度 Policy Gradient</title><link>https://blog.tolshao.xyz/posts/rl_7/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_7/</guid><description>策略梯度法，可以实现不基于价值函数的动作选取，在训练过程中稳定性更优</description><pubDate>Tue, 11 Aug 2020 07:09:31 GMT</pubDate></item><item><title>从0 -&gt; 1，拥有你的免费个人博客之“打个前站”</title><link>https://blog.tolshao.xyz/posts/blog_setup/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/blog_setup/</guid><description>不买域名，不租服务器，不写html，跟我走</description><pubDate>Fri, 07 Aug 2020 16:00:00 GMT</pubDate></item><item><title>ios黄页：可算让iPhone好用了点儿</title><link>https://blog.tolshao.xyz/posts/ios_yellow_page/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/ios_yellow_page/</guid><description>ios黄页，让你用iPhone打电话的时候快人一步</description><pubDate>Fri, 07 Aug 2020 07:09:31 GMT</pubDate></item><item><title>为什么数值仿真里要用RK4（龙格库塔法）</title><link>https://blog.tolshao.xyz/posts/rk4/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rk4/</guid><description>当然是因为他仿真精度高啊，为啥，进来看看吧小跳最近在搭建一个数值仿真环境，由于需要用到python里面的一些库，所以不得不把simulink的模型搬过来，我们都知道在simulink里，仿真的时候设置仿真步长和微分方程求解器是必要的步骤。但是为什么要设置这个小跳却早已忘记了。</description><pubDate>Wed, 05 Aug 2020 07:09:31 GMT</pubDate></item><item><title>强化学习笔记6：值函数估计Value function Approximation</title><link>https://blog.tolshao.xyz/posts/rl_6/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_6/</guid><description>离散空间的RL问题可以构建value table进行查表解决，对于连续空间的问题，可以引入值函数估计器，解决了查表运算量大的问题</description><pubDate>Wed, 05 Aug 2020 07:09:31 GMT</pubDate></item><item><title>深度学习22张精炼图笔记总结</title><link>https://blog.tolshao.xyz/posts/deep_learning_summary/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/deep_learning_summary/</guid><description>记录了深度学习课程的知识与亮点，不仅仅适合初学者了解深度学习，还适合机器学习从业者和研究者复习基本概念。</description><pubDate>Mon, 03 Aug 2020 07:09:31 GMT</pubDate></item><item><title>Keras &amp; Tensorflow 笔记</title><link>https://blog.tolshao.xyz/posts/keras_tensorflow/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/keras_tensorflow/</guid><description>Keras是一个高层神经网络API，能够把你的idea迅速转换为结果</description><pubDate>Mon, 03 Aug 2020 07:09:31 GMT</pubDate></item><item><title>强化学习笔记5：无模型控制 Model-free control</title><link>https://blog.tolshao.xyz/posts/rl_5/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_5/</guid><description>完成了不基于模型的策略评估之后，可以采取$psilon$-greedy等方法进行动作选取，根据状态信息进行动作选取并执行，就实现了不基于模型的控制</description><pubDate>Sat, 01 Aug 2020 07:09:31 GMT</pubDate></item><item><title>深度学习-Coursera笔记</title><link>https://blog.tolshao.xyz/posts/deep_learning/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/deep_learning/</guid><description>深度学习是用深度神经网络的方法，将机器学习加以拓展，其优势是可以实现超复杂非线性函数的映射</description><pubDate>Thu, 23 Jul 2020 07:09:31 GMT</pubDate></item><item><title>控制理论笔记-2</title><link>https://blog.tolshao.xyz/posts/advance_control_2/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/advance_control_2/</guid><description>BiliBili_Dr_can 课程笔记</description><pubDate>Wed, 15 Jul 2020 10:09:31 GMT</pubDate></item><item><title>卷积神经网络CNN（convolutional）</title><link>https://blog.tolshao.xyz/posts/cnn/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/cnn/</guid><description>卷积神经网络，一般用于Computer vision等领域，典型应用有物体检测、人脸识别等</description><pubDate>Wed, 15 Jul 2020 07:09:31 GMT</pubDate></item><item><title>强化学习笔记4：无模型预测 model-free prediction</title><link>https://blog.tolshao.xyz/posts/rl_4/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_4/</guid><description>RL的一个重要突破就是不基于模型的控制，在控制之前，需要先用model-free control对策略进行评估</description><pubDate>Wed, 15 Jul 2020 07:09:31 GMT</pubDate></item><item><title>强化学习笔记3：动态规划 planning by dynamic programming（DP）</title><link>https://blog.tolshao.xyz/posts/rl_3/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_3/</guid><description>规划是基于模型的搜索，学习是基于数据的总结。动态规划DP，用迭代的方法将价值函数、策略收敛到最优</description><pubDate>Fri, 10 Jul 2020 07:09:31 GMT</pubDate></item><item><title>MBSE 基于模型的系统工程</title><link>https://blog.tolshao.xyz/posts/mbse/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/mbse/</guid><description>改变传统设计的繁杂工作流程，用系统的思想、数字化的语言和工具，将项目周期加快，加快，再加快</description><pubDate>Wed, 08 Jul 2020 07:09:31 GMT</pubDate></item><item><title>强化学习笔记2：马尔科夫决策过程Markov decision process(MDP)</title><link>https://blog.tolshao.xyz/posts/rl_2/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_2/</guid><description>将游戏过程、动力学过程抽象为马尔科夫过程MP，便于引入到RL进行研究</description><pubDate>Sun, 05 Jul 2020 07:09:31 GMT</pubDate></item><item><title>强化学习笔记1：基本概念</title><link>https://blog.tolshao.xyz/posts/rl_1/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/rl_1/</guid><description>从简单概念入手，介绍强化学习Reinforcement learning的基本结构</description><pubDate>Wed, 01 Jul 2020 07:09:31 GMT</pubDate></item><item><title>机器学习-Coursera笔记</title><link>https://blog.tolshao.xyz/posts/machine_learning/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/machine_learning/</guid><description>Coursera网站Andrew Ng的ML课程笔记</description><pubDate>Sun, 28 Jun 2020 07:09:31 GMT</pubDate></item><item><title>RNN 序列模型 sequence model</title><link>https://blog.tolshao.xyz/posts/sequence_model/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/sequence_model/</guid><description>RNN序列模型，主要用于自然语言处理NLP等环境，引入attention机制，让网络的input在随时间步进行中，较远的运算之得以保留</description><pubDate>Thu, 25 Jun 2020 07:09:31 GMT</pubDate></item><item><title>科学写作</title><link>https://blog.tolshao.xyz/posts/paper_writting/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/paper_writting/</guid><description>追求效率至上，知道怎么做，比早点出发更重要</description><pubDate>Sun, 03 May 2020 07:09:31 GMT</pubDate></item><item><title>Mac必备软件推荐，让你效率起飞🚀</title><link>https://blog.tolshao.xyz/posts/mac_app/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/mac_app/</guid><description>Mac2021装机必备，让你的效率提高20倍，系统、多媒体、写作都在这儿了</description><pubDate>Thu, 30 Apr 2020 16:00:00 GMT</pubDate></item><item><title>控制理论笔记</title><link>https://blog.tolshao.xyz/posts/advanced_control/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/advanced_control/</guid><description>经典控制理论笔记，线性系统控制理论笔记</description><pubDate>Sun, 05 Apr 2020 10:09:31 GMT</pubDate></item><item><title>Mac设置</title><link>https://blog.tolshao.xyz/posts/settings_mac/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/settings_mac/</guid><description>记录备忘mac系统、软件的设置</description><pubDate>Wed, 01 Apr 2020 07:09:31 GMT</pubDate></item><item><title>Latex设置</title><link>https://blog.tolshao.xyz/posts/settings_latex/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/settings_latex/</guid><description>备忘Latex的设置（持续更新）</description><pubDate>Wed, 18 Mar 2020 07:09:31 GMT</pubDate></item><item><title>Matlab设置</title><link>https://blog.tolshao.xyz/posts/settings_matlab/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/settings_matlab/</guid><description>备忘matlab设置和snippets，自定义打印格式调整和快捷保存图片（持续更新）</description><pubDate>Sat, 01 Feb 2020 07:09:31 GMT</pubDate></item><item><title>python-snippets</title><link>https://blog.tolshao.xyz/posts/settings_python/</link><guid isPermaLink="true">https://blog.tolshao.xyz/posts/settings_python/</guid><description>备忘python的snippets，python画图，从txt、excel读取数据等（持续更新）</description><pubDate>Wed, 01 Jan 2020 07:09:31 GMT</pubDate></item></channel></rss>