Programming Computer Vision with Python （学习笔记五）（learn python programming）

25-04-25 1

对于ProgrammingComputerVisionwithPython感兴趣的读者，本文将提供您所需要的所有信息，我们将详细讲解学习笔记五，并且为您提供关于3DComputerVision、and

对于Programming Computer Vision with Python 感兴趣的读者，本文将提供您所需要的所有信息，我们将详细讲解学习笔记五，并且为您提供关于3D Computer Vision、android studio问题-E / Vision：加载可选模块com.google.android.gms.vision.ocr时出错、awesome computer vision repo、Azure Computer Vision 之 Smart Crop 智能裁剪图片的宝贵知识。

本文目录一览：

Programming Computer Vision with Python （学习笔记五）（learn python programming）
3D Computer Vision
android studio问题-E / Vision：加载可选模块com.google.android.gms.vision.ocr时出错
awesome computer vision repo
Azure Computer Vision 之 Smart Crop 智能裁剪图片

Programming Computer Vision with Python （学习笔记五）（learn python programming）

SciPy库

SciPy库，与之前我们使用的NumPy和Matplotlib，都是scipy.org提供的用于科学计算方面的核心库。相对NumPy，SciPy库提供了面向更高层应用的算法和函数(其实也是基于NumPy实现的），并以子模块的形式组织，每个子模块对应不同的应用领域，下面列出我们关心的一部分进行说明：
constants：物理和数学常量
fftpack：快速傅里叶（立）变换
integrate: 积分和方程
interpolate: 插值
linalg: 线性代数
ndimage: N维图像处理
optimize: 优化及根求解

安装

sudo apt-get install python-scipy

图像模糊数学原理

以灰度图像为例，把图像的每个像素的灰度值，变换为它周围邻近的N个像素值的平均值，得出的图像就有了模糊效果，但这种效果不理想，体现不出边缘（不够自然吧），所以提出了一种比较理想的方法，就是使用加权平均值，因为对某个像素而言，离它越近的像素，与它的关联性越高，所以权值应该越大，相反，离它越远的像素，与它的关联性越低，权值应该越小。

那分配权值应该使用什么样的算法呢？最常用的就是高斯分布函数。

高斯分布函数其实应该是高斯分布的概率密度函数，简称高斯分布函数或正态分布函数，它的二维空间的形态像一个钟，如图：
512px-Gaussian_2d.svg.png

这里我们只关心二维的高斯分布函数，因为等下我们要把它应用在二维图像的像素的权值分配上。下面是二维空间的高斯分布函数公式：
图片描述

这个公式被称作高斯核。如果我们给定sigma（如0.84089642）的值，以及高斯分布的（范围）大小，就可以得出一个矩阵如：
图片描述
注意中心元素 (4,4)处有最大值，随着距离中心越远，数值一圈圈地对称减小。这样，矩阵的每个元素相对中心点的权值就已经知道了，如果我们把这个矩阵的中心对应到图像的每个点，就可以知道中心点对应的像素应该如何通过周围的点来计算。

如果矩阵的中心对应到原图像边沿的像素点，那么可能有一部分矩阵的元素落到图像之外，为保证能正常运算，一种简单的处理就是把图像一边的数据复制到缺少的一边去。

对原图像的每个像素点，运用上面的权值矩阵，算出加权平均值，作为新图像的像素值，这个运算可以用原图像与矩阵的卷积来表示，而参与卷积的这个矩阵，叫模板，这种卷积运算，叫模板卷积。模板也称为算子。上面示例的这个矩阵，其实就是高斯平均算子。

高斯模糊

使用高斯平均算子来实现的图像模糊叫高斯模糊(Gaussian blurring，也叫高斯平滑)
被认为是一种最优的图像平滑处理。除了模糊，还可以用来磨皮（美颜）、制作景深、实现梦幻效果等。下面我们使用Python来对一幅图像进行模糊处理。

我们不必自己实现高斯模糊的计算过程，因为前面介绍的scipy的ndimage子模块中就包含了高斯滤波器，它是一个函数：

scipy.ndimage.filters.gaussian_filter(input, sigma, order=0, ...
多维高斯滤波器
input: ndarray
sigma: 高斯核标准偏差，越大越模糊
order: 默认0，使用高斯核进行计算，还可以指定不同值使用高斯一阶、二阶导数，以后再说

以下代码示例以sigma为2,5,10分别对一张图像进行模糊，我们使用的是GRB图像，有3个通道，必须对每个通道都应用滤波器：

from PIL import Image
import numpy as np
from scipy.ndimage import filters
import matplotlib.pyplot as plt

im = np.array(Image.open(''Penguins.jpg''))

index = 141  #画1行四列的图，与 1,4,1 同
plt.subplot(index)
plt.imshow(im)

for sigma in (2, 5, 10):
    im_blur = np.zeros(im.shape, dtype=np.uint8)
    for i in range(3):  #对图像的每一个通道都应用高斯滤波
        im_blur[:,:,i] = filters.gaussian_filter(im[:,:,i], sigma)

    index += 1
    plt.subplot(index)
    plt.imshow(im_blur)

plt.show()

不同的sigma模糊程度对比：
图片描述

小结

下一个笔记将记录有关图像边缘检测的实现，还会继续运用模板卷积运算，将涉及到图像导数，以及用于边缘检测的常用算子的介绍。

参考资料
维基百科：高斯模糊
高斯模糊浅析
高斯模糊的算法
[计算机视觉特征提取与图像处理（第三版）]

3D Computer Vision

3D Computer Vision
Programming Assignment 2 – Epipolar Geometry
You will upload your codes and a short report (PDF) in a zip file to the NewE3 system. Grading will be done
at demo time (face-to-face or Skype).
A C++ Visual Studio project is provided. To build the code, install VS 2019 (Community). When open the
solution file (project2.sln), be sure NOT to upgrade the Windows SDK Version nor the Platform Toolset:
The project should be buildable and runnable on a Windows system. Your tasks are:

[2p] For the test stereo images (pictures/stereo1_left.png , stereo1_right.png), find 8 matching pairs of
2D points. List them as g_matching_left and g_matching_right. Note: x and y are in [-1,1] range. You
can define the matching manually or
[Bonus: +1~2p to mid-term] use off-the-shelf matching methods (such as OpenGL feature matching or
others). The bonus amount depends on how well you understood and explains your matching method.
[5p] Implement the normalized eight-point method in EpipolarGeometry() to calculate the fundamental
matrix (same as essential matrix). Remember to fill your result in g_epipolar_E To verify your result, the
eight “*multiply:” stdout should output values very close to zero (around e-6 ~ e-7). The rendering
should look like:
(Here the 8 matching are the 8 vertices of the “cube”. But your matching can be anything.)
[1p] Explain what line 382-389 do? What does the “multiply” result means? Why should all the multiply
values be (close to) zero?
[3p] Download the OpenCV sfm module source code at https://github.com/opencv/ope... Go
to \modules\sfm\src\libmv_light\libmv\multiview. Explain the following functions:
FundamentalFromEssential () in fundamental.cc [1p].
MotionFromEssential() in fundamental.cc [1p].
P_From_KRt () in projection.cc [1p].
Note: “HZ” means the textbook “Multiple View Geometry in Computer Vision” by Richard Hartley and
Andrew Zisserman and a pdf is provided for your reference.
WX：codehelp

android studio问题-E / Vision：加载可选模块com.google.android.gms.vision.ocr时出错

如何解决android studio问题-E / Vision：加载可选模块com.google.android.gms.vision.ocr时出错？

我在使用Google视觉OCR库的android studio中遇到问题。这是错误：

W/DynamiteModule: Local module descriptor class for com.google.android.gms.vision.dynamite.ocr not found.
I/DynamiteModule: Considering local module com.google.android.gms.vision.dynamite.ocr:0 and remote module com.google.android.gms.vision.dynamite.ocr:0
W/DynamiteModule: Local module descriptor class for com.google.android.gms.vision.ocr not found.
I/DynamiteModule: Considering local module com.google.android.gms.vision.ocr:0 and remote module com.google.android.gms.vision.ocr:0
E/Vision: Error loading optional module com.google.android.gms.vision.ocr: com.google.android.gms.dynamite.DynamiteModule$LoadingException: No acceptable module found. Local version is 0 and remote version is 0.

你能帮我吗？

解决方法

暂无找到可以解决该程序问题的有效方法，小编努力寻找整理中！

如果你已经找到好的解决方法，欢迎将解决方案带上本链接一起发送给小编。

小编邮箱:dio#foxmail.com (将#修改为@）

awesome computer vision repo

https://blog.csdn.net/guoyunfei20/article/details/88530159

# AwesomeComputerVision

**Multi-Object-Tracking-Paper-List**
https://github.com/SpyderXu/multi-object-tracking-paper-list

**awesome-object-detection**
https://github.com/hoya012/deep_learning_object_detection

**awesome-image-classification**
https://github.com/weiaicunzai/awesome-image-classification

**Visual-Tracking-Paper-List**
https://github.com/foolwood/benchmark_results

**awesome-semantic-segmentation**
https://github.com/mrgloom/awesome-semantic-segmentation

**awesome-human-pose-estimation**
https://github.com/cbsudux/awesome-human-pose-estimation

**awesome-Face-Recognition**

————————————————
版权声明：本文为CSDN博主「guoyunfei20」的原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接及本声明。
原文链接：https://blog.csdn.net/guoyunfei20/article/details/88530159

Azure Computer Vision 之 Smart Crop 智能裁剪图片

前言

一个网站通常有许多地方会用到同一张图，但是比例又不一样.

一般的做法就是用 CSS 的 cover 和 contain 来处理.

由于 cover 只会保留中间信息, 所以很多时候需要人工裁剪.

于是就有了智能裁剪的需求了.

Azure Computer Vision

参考:

官网示范

价格

价格还可以

Azure SDK for .NET

Sample code

实现步骤

1. 到 Azure portal 创建 Computer Vision

没有什么特别的, 默认就可以了 (注: 一个 account 只能有一个 free 的 Computer Vision 哦)

2. 进入 Computer Vision Resource > Keys and Endpoint 把 key 和 endpoint 抄起来

3. 安装 SDK

dotnet add package Microsoft.Azure.CognitiveServices.Vision.ComputerVision

2 个核心功能, 第 1 个是获取全图焦点, 第 2 个是给定要求智能裁剪

[HttpPost("SmartCrop")]
public async Task<ActionResult> SmartCropAsync()
{
    var subscriptionKey = "key";
    var endpoint = "https://jbreviews-cv.cognitiveservices.azure.com/";
    var client = new ComputerVisionClient(new ApiKeyServiceClientCredentials(subscriptionKey))
    {
        Endpoint = endpoint
    };
    var imageFileFullPath = @"WebApi\\Controller\\Test\\SmartCrop\\10.png";
    using var imagestream = new FileStream(imageFileFullPath, FileMode.Open);
    // get area of interest
    var areaOfInterestResult = await client.GetAreaOfInterestInStreamAsync(imagestream); // 这里返回之后 imagestream 就自动被 close 了
    using var image = Image.Load(imageFileFullPath);
    var croppedImage = image.Clone(imageProcessing =>
    {
        imageProcessing.Crop(new Rectangle(
            x: areaOfInterestResult.AreaOfInterest.X,
            y: areaOfInterestResult.AreaOfInterest.Y,
            width: areaOfInterestResult.AreaOfInterest.W,
            height: areaOfInterestResult.AreaOfInterest.H)
        );
    });
    croppedImage.SaveAsJpeg(
        @"WebApi\\Controller\\Test\\SmartCrop\\11.png", new SixLabors.ImageSharp.Formats.Jpeg.JpegEncoder
        {
            Quality = 85
        }
    );
    
    // get smart crop image
    using var imagestream2 = new FileStream(imageFileFullPath, FileMode.Open);
    var croppedimagestream = await client.GenerateThumbnailInStreamAsync(300, 100, imagestream2, smartCropping: true);
    using var imageFileStream = System.IO.File.Create(@"WebApi\\Controller\\Test\\SmartCrop\\12.png");
    croppedimagestream.copyTo(imageFileStream);
    return Ok();
}

关于Programming Computer Vision with Python 和学习笔记五的问题我们已经讲解完毕，感谢您的阅读，如果还想了解更多关于3D Computer Vision、android studio问题-E / Vision：加载可选模块com.google.android.gms.vision.ocr时出错、awesome computer vision repo、Azure Computer Vision 之 Smart Crop 智能裁剪图片等相关内容，可以在本站寻找。

本文标签：