经管实证
发布于

实证匹配技巧:公司名称中英文对照匹配案例

一、案例背景

- 根据已知的公司名称中英文对照表,与符合条件的公司中英文名称匹配


![](https://files.mdnice.com/user/22371/1743d48b-b0d1-4782-b00e-f018b17fd8ab.png)


![](https://files.mdnice.com/user/22371/1533d953-0e48-4d20-872b-98b5a165f066.png)


- 36分钟讲解视频


![](https://files.mdnice.com/user/22371/f90da92b-9a69-4610-b1d9-8090fb4f84c0.png)


二、部分code

```Python

import pandas as pd

import numpy as np

import warnings

warnings.filterwarnings("ignore")



f_name = pd.read_stata("f_name.dta")

p_name = pd.read_stata("p_name.dta")

t_name = pd.read_stata("tax_name.dta")


p_name = p_name.replace({'':np.nan}).dropna()


match1 = pd.merge(left=f_name, right=p_name,

                 left_on="n_f", right_on="n_p",

                 how="right")

match1.dropna()


f_name["f_ori"] = f_name["n_f"]

p_name["p_ori"] = p_name["n_p"]



f_name["n_f"] = f_name["n_f"].str.lower()

p_name["n_p"] = p_name["n_p"].str.lower()



del_str = ["co", "corporation", "corp",

          "imp", "import",

          "exp", "export",

          "trading", "trade",

          "&", "and",

          "ltd",

          "'", "\"", ".", ",", "/", "-", " "]


for i in del_str:

   f_name["n_f"] = f_name["n_f"].str.replace(i, "")

   p_name["n_p"] = p_name["n_p"].str.replace(i, "")

   

……

```

浏览 (404)
点赞 (1)
收藏
删除
评论