递归搜索文件夹问题,Python交流,编程语言专区,鱼C论坛

davidmou 发表于 2023-7-30 22:00:25

递归搜索文件夹问题

import easygui as g
import os

def show_result(start_dir):
lines = 0
total = 0
text = ""

for i in source_list:
   lines = source_list
   total += lines
   text += "【%s】源文件 %d 个，源代码 %d 行\n" % (i, file_list, lines)
title = '统计结果'
msg = '您目前共累积编写了 %d 行代码，完成进度：%.2f %%\n离 10 万行代码还差 %d 行，请继续努力！' % (
   total, total/1000, 100000-total)
g.textbox(msg, title, text)

def calc_code(file_name):
lines = 0
with open(file_name) as f:
   print('正在分析文件：%s ...' % file_name)
   try:
         for each_line in f:
            lines += 1
   except UnicodeDecodeError:
         pass# 不可避免会遇到格式不兼容的文件，这里忽略掉......
return lines

def search_file(start_dir):
os.chdir(start_dir)

for each_file in os.listdir(os.curdir):
   ext = os.path.splitext(each_file)
   if ext in target:
         lines = calc_code(each_file)# 统计行数
         # 还记得异常的用法吗？如果字典中不存，抛出 KeyError，则添加字典键
         # 统计文件数
         try:
            file_list += 1
         except KeyError:
            file_list = 1
         # 统计源代码行数
         try:
            source_list += lines
         except KeyError:
            source_list = lines

   if os.path.isdir(each_file):
         os.chdir(os.path.join(os.curdir,each_file))
         search_file(os.getcwd())# 递归调用
         # 递归调用后切记返回上一层目录

target = ['.c', '.cpp', '.py', '.cc', '.java', '.pas', '.asm']
file_list = {}
source_list = {}

g.msgbox("请打开您存放所有代码的文件夹......", "统计代码量")
path = g.diropenbox("请选择您的代码库：")

search_file(path)
show_result(path)

不知道要怎么修改@_@

Mike_python小 发表于 2023-7-30 22:00:53

根据您提供的代码，我看到其中存在一些问题。以下是您可以考虑修改的地方：

1. 在show_result()函数中，循环变量应该是file_list而不是source_list。请将代码修改为：

for i in file_list:
lines = source_list
total += lines
text += "【%s】源文件 %d 个，源代码 %d 行\n" % (i, file_list, lines)

2. 在calc_code()函数中，您在使用open函数打开文件时，忘记了传递文件名参数。请将代码修改为：

with open(file_name, 'r', encoding='utf-8') as f:

并且将UnicodeDecodeError的处理方式修改为：

except UnicodeDecodeError:
pass# 不可避免会遇到格式不兼容的文件，这里忽略掉...

3. 在search_file()函数中，os.chdir()调用后，应该返回上一层目录以继续遍历其他文件。请将代码修改为：

os.chdir(os.path.pardir)# 返回上一级目录

4. 在主程序部分，您在调用show_result()函数时，传递的参数是path变量，但在show_result()函数中并未使用该参数。您可以直接将search_file(path)修改为search_file(start_dir)，然后在show_result()函数中使用start_dir。

请注意，修改代码时应注意代码缩进的正确性以及变量名的一致性。希望这些修改可以帮助您解决问题。如果还有其他疑问，请随时提问。

球一个最佳答案谢谢啦！这对我非常重要！{:10_254:}{:10_254:}{:10_254:}

琅琊王朝 发表于 2023-7-30 22:44:38

如果你想修改递归搜索文件夹的代码，可以尝试以下修改：
修改calc_code函数：将计算行数的部分改为递归方式，以支持对子文件夹内的文件进行统计。修改后的代码如下：
def calc_code(file_name):
lines = 0
with open(file_name) as f:
   print('正在分析文件：%s ...' % file_name)
   try:
         for each_line in f:
            lines += 1
   except UnicodeDecodeError:
         pass# 不可避免会遇到格式不兼容的文件，这里忽略掉......

return lines

def calc_code_recursive(file_name):
if os.path.isfile(file_name):
   return calc_code(file_name)
elif os.path.isdir(file_name):
   count = 0
   for root, dirs, files in os.walk(file_name):
         for file in files:
            count += calc_code_recursive(os.path.join(root, file))

   return count

修改search_file函数：在调用calc_code时，改为调用calc_code_recursive进行递归统计，支持对子文件夹进行搜索。同时，在递归调用后返回上一层目录。修改后的代码如下：
def search_file(start_dir):
os.chdir(start_dir)

for each_file in os.listdir(os.curdir):
   ext = os.path.splitext(each_file)
   if ext in target:
         lines = calc_code_recursive(each_file)# 统计行数
         # 统计文件数
         try:
            file_list += 1
         except KeyError:
            file_list = 1
         # 统计源代码行数
         try:
            source_list += lines
         except KeyError:
            source_list = lines

   if os.path.isdir(each_file):
         next_dir = os.path.join(os.curdir, each_file)
         os.chdir(next_dir)
         search_file(os.getcwd())# 递归调用
         os.chdir(os.path.pardir)# 返回上一层目录

通过以上修改，你可以实现在指定文件夹下递归搜索所有文件（包括子文件夹），统计各类文件的数量和源代码行数。

davidmou 发表于 2023-7-30 23:26:21

为什么search_file(os.getcwd())递归后还要os.chdir(os.path.pardir)

Mike_python小 发表于 2023-7-31 07:36:55

davidmou 发表于 2023-7-30 20:26
为什么search_file(os.getcwd())递归后还要os.chdir(os.path.pardir)

感谢您提出的问题！在search_file()函数中，为了能够递归地遍历目录树并搜索文件，我们需要使用os.chdir()函数切换到当前目录的子目录。而在子目录递归完成后，我们需要返回上一级目录以继续搜索其他文件。这就是为什么在search_file()函数中使用os.chdir(os.path.pardir)来返回上一级目录。

具体来说，os.getcwd()返回当前工作目录的路径，而os.chdir(os.path.pardir)将当前工作目录切换到其父目录（上一级目录）。这样，在递归调用search_file()函数后，程序会返回到上一级目录，然后继续搜索其他文件。

这个步骤是为了确保程序能够在整个目录树中遍历所有文件，并找到目标文件。

希望这个解释能够帮助您理解为什么需要在递归后使用os.chdir(os.path.pardir)。如果还有任何疑问，请随时提问！

页: [1]

鱼C论坛's Archiver

递归搜索文件夹问题