本文的目的是介绍有没有一种简单的方法可以使用numpy/pandas将具有相同唯一标识符的多行折叠成一行?的详细情况,我们将通过专业的研究、有关数据的分析等多种方式,同时也不会遗漏关于"importn
本文的目的是介绍有没有一种简单的方法可以使用 numpy/pandas 将具有相同唯一标识符的多行折叠成一行?的详细情况,我们将通过专业的研究、有关数据的分析等多种方式,同时也不会遗漏关于"import numpy as np" ImportError: No module named numpy、3.7Python 数据处理篇之 Numpy 系列 (七)---Numpy 的统计函数、Anaconda Numpy 错误“Importing the Numpy C Extension Failed”是否有另一种解决方案、Difference between import numpy and import numpy as np的知识。
本文目录一览:- 有没有一种简单的方法可以使用 numpy/pandas 将具有相同唯一标识符的多行折叠成一行?
- "import numpy as np" ImportError: No module named numpy
- 3.7Python 数据处理篇之 Numpy 系列 (七)---Numpy 的统计函数
- Anaconda Numpy 错误“Importing the Numpy C Extension Failed”是否有另一种解决方案
- Difference between import numpy and import numpy as np
有没有一种简单的方法可以使用 numpy/pandas 将具有相同唯一标识符的多行折叠成一行?
如何解决有没有一种简单的方法可以使用 numpy/pandas 将具有相同唯一标识符的多行折叠成一行?
我有一个贷款数据框,如下所示:
Loan Dataframe
我的目标是每个贷款 ID 只有一行,而不是多行。我想为共同借款人和主要借款人的年龄设置单独的列。我知道共同借款人的最大数量,所以我知道要创建的列数。
Desired data frame
我编写了一个脚本来实现这一点,但是,在具有 30K 行的数据帧上运行大约需要 6 分钟。有没有更快的方法来做到这一点?下面是我的代码片段:
loan_id = []
idx = 0
col_count = 0
idx_col = 0
# first,sort the dataframe to make sure same loan numbers are together
co_ap.sort_values(by = [''Loan No.''],inplace = True)
for loan in laon[''Loan ID''].items():
if loan[1] not in loan_id:
loan_id.append(loan[1])
col_count = 0
idx_col = idx
if loan[''Borroer''][idx] != ''Main'':
col_count += 1
# update desired column
loan[''ST_Age_coap_'' + str(col_count)][idx_col] = loan[''Age''][idx]
else:
loan[''ST_Age_main''][idx_col] = loan[''Age''][idx]
# if the idx_col != idx,that means we are operating on a row,which we eventually have to drop,# input a dummy value in any column,which will act as an identifier later on to kNow which rows to drop
if idx_col != idx:
loan[''ST_Age_main''][idx] = -1
idx += 1
# drop rows not required
co_ap = co_ap[co_ap[''ST_Age_main''] != -1]
解决方法
我认为 pivot
很可能是您要找的。p>
,
df.pivot(index=''Loan ID'',columns=''Borrower'',values=''Age'')
我认为你可以用 groupby
做 cumcount
并申请 pivot_table
df=pd.DataFrame({''Loan ID'':[''a'',''a'',''b'',''c'',''c''],''Borrower'':[''co'',''co'',''main'',''co''],''age'':[23,24,27,45,34,33,23]})
df[''temp'']=df.groupby([df[''Loan ID''],df[''Borrower'']]).cumcount().astype(str)
df[''temp'']=np.where(df[''Borrower'']==''main'','''',df[''temp''])
df_new=df.pivot_table(index=''Loan ID'',columns=df[''Borrower'']+df[''temp''],values=''age'',aggfunc=sum)
df_new.columns=[i+ '' Age'' for i in df_new.columns]
df_new=df_new.reset_index()
"import numpy as np" ImportError: No module named numpy
问题:没有安装 numpy
解决方法:
下载文件,安装
numpy-1.8.2-win32-superpack-python2.7
安装运行 import numpy,出现
Traceback (most recent call last):
File "<pyshell#2>", line 1, in <module>
import numpy
File "C:\Python27\lib\site-packages\numpy\__init__.py", line 153, in <module>
from . import add_newdocs
File "C:\Python27\lib\site-packages\numpy\add_newdocs.py", line 13, in <module>
from numpy.lib import add_newdoc
File "C:\Python27\lib\site-packages\numpy\lib\__init__.py", line 8, in <module>
from .type_check import *
File "C:\Python27\lib\site-packages\numpy\lib\type_check.py", line 11, in <module>
import numpy.core.numeric as _nx
File "C:\Python27\lib\site-packages\numpy\core\__init__.py", line 6, in <module>
from . import multiarray
ImportError: DLL load failed: %1 不是有效的 Win32 应用程序。
原因是:python 装的是 64 位的,numpy 装的是 32 位的
重新安装 numpy 为:numpy-1.8.0-win64-py2.7
3.7Python 数据处理篇之 Numpy 系列 (七)---Numpy 的统计函数
目录
[TOC]
前言
具体我们来学 Numpy 的统计函数
(一)函数一览表
调用方式:np.*
.sum(a) | 对数组 a 求和 |
---|---|
.mean(a) | 求数学期望 |
.average(a) | 求平均值 |
.std(a) | 求标准差 |
.var(a) | 求方差 |
.ptp(a) | 求极差 |
.median(a) | 求中值,即中位数 |
.min(a) | 求最大值 |
.max(a) | 求最小值 |
.argmin(a) | 求最小值的下标,都处里为一维的下标 |
.argmax(a) | 求最大值的下标,都处里为一维的下标 |
.unravel_index(index, shape) | g 根据 shape, 由一维的下标生成多维的下标 |
(二)统计函数 1
(1)说明
(2)输出
.sum(a)
.mean(a)
.average(a)
.std(a)
.var(a)
(三)统计函数 2
(1)说明
(2)输出
.max(a) .min(a)
.ptp(a)
.median(a)
.argmin(a)
.argmax(a)
.unravel_index(index,shape)
作者:Mark
日期:2019/02/11 周一
Anaconda Numpy 错误“Importing the Numpy C Extension Failed”是否有另一种解决方案
如何解决Anaconda Numpy 错误“Importing the Numpy C Extension Failed”是否有另一种解决方案?
希望有人能在这里提供帮助。我一直在绕圈子一段时间。我只是想设置一个 python 脚本,它将一些 json 数据从 REST API 加载到云数据库中。我在 Anaconda 上设置了一个虚拟环境(因为 GCP 库推荐这样做),安装了依赖项,现在我只是尝试导入库并向端点发送请求。 我使用 Conda(和 conda-forge)来设置环境并安装依赖项,所以希望一切都干净。我正在使用带有 Python 扩展的 VS 编辑器作为编辑器。 每当我尝试运行脚本时,我都会收到以下消息。我已经尝试了其他人在 Google/StackOverflow 上找到的所有解决方案,但没有一个有效。我通常使用 IDLE 或 Jupyter 进行脚本编写,没有任何问题,但我对 Anaconda、VS 或环境变量(似乎是相关的)没有太多经验。 在此先感谢您的帮助!
\Traceback (most recent call last):
File "C:\Conda\envs\gcp\lib\site-packages\numpy\core\__init__.py",line 22,in <module>
from . import multiarray
File "C:\Conda\envs\gcp\lib\site-packages\numpy\core\multiarray.py",line 12,in <module>
from . import overrides
File "C:\Conda\envs\gcp\lib\site-packages\numpy\core\overrides.py",line 7,in <module>
from numpy.core._multiarray_umath import (
ImportError: DLL load Failed while importing _multiarray_umath: The specified module Could not be found.
During handling of the above exception,another exception occurred:
Traceback (most recent call last):
File "c:\API\citi-bike.py",line 4,in <module>
import numpy as np
File "C:\Conda\envs\gcp\lib\site-packages\numpy\__init__.py",line 150,in <module>
from . import core
File "C:\Conda\envs\gcp\lib\site-packages\numpy\core\__init__.py",line 48,in <module>
raise ImportError(msg)
ImportError:
IMPORTANT: PLEASE READ THIS FOR ADVICE ON HOW TO SOLVE THIS ISSUE!
Importing the numpy C-extensions Failed. This error can happen for
many reasons,often due to issues with your setup or how NumPy was
installed.
We have compiled some common reasons and troubleshooting tips at:
https://numpy.org/devdocs/user/troubleshooting-importerror.html
Please note and check the following:
* The Python version is: python3.9 from "C:\Conda\envs\gcp\python.exe"
* The NumPy version is: "1.21.1"
and make sure that they are the versions you expect.
Please carefully study the documentation linked above for further help.
Original error was: DLL load Failed while importing _multiarray_umath: The specified module Could not be found.
解决方法
暂无找到可以解决该程序问题的有效方法,小编努力寻找整理中!
如果你已经找到好的解决方法,欢迎将解决方案带上本链接一起发送给小编。
小编邮箱:dio#foxmail.com (将#修改为@)
Difference between import numpy and import numpy as np
Difference between import numpy and import numpy as np
up vote 18 down vote favorite 5 |
I understand that when possible one should use This helps keep away any conflict due to namespaces. But I have noticed that while the command below works the following does not Can someone please explain this? python numpy
|
||||||||
add a comment |
4 Answers
active oldest votes
up vote 13 down vote |
numpy is the top package name, and doing When you do In your above code: Here is the difference between
|
|||
add a comment |
up vote 7 down vote |
The When you import a module via the numpy package is bound to the local variable Thus, is equivalent to, When trying to understand this mechanism, it''s worth remembering that When importing a submodule, you must refer to the full parent module name, since the importing mechanics happen at a higher level than the local variable scope. i.e. I also take issue with your assertion that "where possible one should [import numpy as np]". This is done for historical reasons, mostly because people get tired very quickly of prefixing every operation with Finally, to round out my exposé, here are 2 interesting uses of the 1. long subimports 2. compatible APIs
|
||
add a comment |
up vote 1 down vote |
when you call the statement
|
||
add a comment |
up vote 1 down vote |
This is a language feature. This feature allows:
Notice however that Said that, when you run You receive an
|
||||||||
add a comment |
关于有没有一种简单的方法可以使用 numpy/pandas 将具有相同唯一标识符的多行折叠成一行?的问题我们已经讲解完毕,感谢您的阅读,如果还想了解更多关于"import numpy as np" ImportError: No module named numpy、3.7Python 数据处理篇之 Numpy 系列 (七)---Numpy 的统计函数、Anaconda Numpy 错误“Importing the Numpy C Extension Failed”是否有另一种解决方案、Difference between import numpy and import numpy as np等相关内容,可以在本站寻找。
本文标签: