Find.find, limited recursion

Brian_Wallace · 15 September 2009 21:01

Hi all,

I'm working on a script that's currently using Find.find to process a
complete directory tree of files and directories .. .however I needed
specific behavior, and I'm still fairly new to this...

Basically it works like this, the user specifies the root directory of a
collection of sub-directories we are interested in .. for instance:

User specifies: C:\Root

C:\Root
          >
          -->\Dir1
                   >
                   -->Files_in_dir1
          >
          -->\Dir2
                   >
                   -->Files_in_dir2

But we are only interested in the _files_ in \Dir1 , \Dir2 ..and want to
essentially Prune recursion into C:\Root\Dir1\Subdir_of_Dir1 , and
C:\Root\Dir2\Subdir_of_Dir2 if those exist ..

This is what I am currently using, however it will pick up the sub
directories of Dir1 and Dir2 and update the hash... Does anyone see any way
this could be refactored to not update the "process_list" hash with the sub
directories of Dir1 and Dir2?

···

--

@root_dir = "C:\Root"

process_list = Hash.new {|h,k| h[k] = []}

Find.find(@root_dir.to_s) do |f|
  f = f.gsub(/\\/,'/')
  next if File.stat(f).directory? or f.include?('skipped_filename')
  Find.prune if f.include?("skipped_dirname")
   d, b = File.split(f)
  process_list [d] << b
end

Thanks in advance!

Brian

Daniel_Berger2 · 16 September 2009 01:48

Brian Wallace wrote:

Hi all,

I'm working on a script that's currently using Find.find to process a
complete directory tree of files and directories .. .however I needed
specific behavior, and I'm still fairly new to this...

Basically it works like this, the user specifies the root directory of a
collection of sub-directories we are interested in .. for instance:

User specifies: C:\Root

C:\Root
          >
          -->\Dir1
                   >
                   -->Files_in_dir1
          >
          -->\Dir2
                   >
                   -->Files_in_dir2

But we are only interested in the _files_ in \Dir1 , \Dir2 ..and want to
essentially Prune recursion into C:\Root\Dir1\Subdir_of_Dir1 , and
C:\Root\Dir2\Subdir_of_Dir2 if those exist ..

This is what I am currently using, however it will pick up the sub
directories of Dir1 and Dir2 and update the hash... Does anyone see any way
this could be refactored to not update the "process_list" hash with the sub
directories of Dir1 and Dir2?

--

@root_dir = "C:\Root"

process_list = Hash.new {|h,k| h[k] = }

Find.find(@root_dir.to_s) do |f|
  f = f.gsub(/\\/,'/')
  next if File.stat(f).directory? or f.include?('skipped_filename')
  Find.prune if f.include?("skipped_dirname")
   d, b = File.split(f)
  process_list [d] << b
end

Thanks in advance!

Brian

Perhaps you would like file-find better. I think this is what you want:

gem install file-find

rule = File::Find.new(
    :path => ['C:/Root/Dir1', 'C:/Root/Dir2'],
    :maxdepth => 2,
    :directory? => false
)

p rule.find

http://shards.rubyforge.org/wiki/wiki.pl?File-Find

Regards,

Dan

Robert_K1 · 16 September 2009 06:51

require 'no gems' # silly joke

Dir["#{@root_dir}/*/*"].each do |f|
  next unless File.file? f
  d, b = File.split f
  process_list[d] << b
end

Kind regards

robert

···

2009/9/15 Brian Wallace <draygen80@gmail.com>:

Hi all,

I'm working on a script that's currently using Find.find to process a
complete directory tree of files and directories .. .however I needed
specific behavior, and I'm still fairly new to this...

Basically it works like this, the user specifies the root directory of a
collection of sub-directories we are interested in .. for instance:

User specifies: C:\Root

C:\Root
>
-->\Dir1
>
-->Files_in_dir1
>
-->\Dir2
>
-->Files_in_dir2

But we are only interested in the _files_ in \Dir1 , \Dir2 ..and want to
essentially Prune recursion into C:\Root\Dir1\Subdir_of_Dir1 , and
C:\Root\Dir2\Subdir_of_Dir2 if those exist ..

This is what I am currently using, however it will pick up the sub
directories of Dir1 and Dir2 and update the hash... Does anyone see any way
this could be refactored to not update the "process_list" hash with the sub
directories of Dir1 and Dir2?

--

@root_dir = "C:\Root"

process_list = Hash.new {|h,k| h[k] = }

Find.find(@root_dir.to_s) do |f|
f = f.gsub(/\\/,'/')
next if File.stat(f).directory? or f.include?('skipped_filename')
Find.prune if f.include?("skipped_dirname")
d, b = File.split(f)
process_list [d] << b
end

Thanks in advance!

--
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/

Bertram_Scharpf · 16 September 2009 17:09

Hi,

I'm working on a script that's currently using Find.find to process a
complete directory tree of files and directories .. .however I needed
specific behavior, and I'm still fairly new to this...

This is what I am currently using, however it will pick up the sub
directories of Dir1 and Dir2 and update the hash... Does anyone see any way
this could be refactored to not update the "process_list" hash with the sub
directories of Dir1 and Dir2?

--

@root_dir = "C:\Root"

process_list = Hash.new {|h,k| h[k] = }

Find.find(@root_dir.to_s) do |f|
  f = f.gsub(/\\/,'/')
  next if File.stat(f).directory? or f.include?('skipped_filename')
  Find.prune if f.include?("skipped_dirname")
   d, b = File.split(f)
  process_list [d] << b
end

Of course I recommend my RbFind tool. It is pure Ruby, no C to
compile.

<http://raa.ruby-lang.org/project/rbfind>

Do something like (untested):

  process_list = Hash.new { |h,k| h[k] = }

  RbFind.run root do
    prune if dir?
    next if name =~ /skipped/
    process_list[ dirname] << name
  end

Bertram

···

Am Mittwoch, 16. Sep 2009, 06:01:48 +0900 schrieb Brian Wallace:

--
Bertram Scharpf
Stuttgart, Deutschland/Germany
http://www.bertram-scharpf.de

Brian_Wallace · 16 September 2009 13:43

Thanks Dan,

That would actually work out OK - however since i'm using JRuby I wouldn't
be able to use a gem with native C extensions..

Thanks for the suggestion!

Brian

···

On Tue, Sep 15, 2009 at 9:48 PM, Daniel Berger <djberg96@gmail.com> wrote:

Brian Wallace wrote:

Hi all,

I'm working on a script that's currently using Find.find to process a
complete directory tree of files and directories .. .however I needed
specific behavior, and I'm still fairly new to this...

Basically it works like this, the user specifies the root directory of a
collection of sub-directories we are interested in .. for instance:

User specifies: C:\Root

C:\Root
         >
         -->\Dir1
                  >
                  -->Files_in_dir1
         >
         -->\Dir2
                  >
                  -->Files_in_dir2

But we are only interested in the _files_ in \Dir1 , \Dir2 ..and want to
essentially Prune recursion into C:\Root\Dir1\Subdir_of_Dir1 , and
C:\Root\Dir2\Subdir_of_Dir2 if those exist ..

This is what I am currently using, however it will pick up the sub
directories of Dir1 and Dir2 and update the hash... Does anyone see any
way
this could be refactored to not update the "process_list" hash with the
sub
directories of Dir1 and Dir2?

--

@root_dir = "C:\Root"

process_list = Hash.new {|h,k| h[k] = }

Find.find(@root_dir.to_s) do |f|
f = f.gsub(/\\/,'/')
next if File.stat(f).directory? or f.include?('skipped_filename')
Find.prune if f.include?("skipped_dirname")
  d, b = File.split(f)
process_list [d] << b
end

Thanks in advance!

Brian

Perhaps you would like file-find better. I think this is what you want:

gem install file-find

rule = File::Find.new(
  :path => ['C:/Root/Dir1', 'C:/Root/Dir2'],
  :maxdepth => 2,
  :directory? => false
)

p rule.find

http://shards.rubyforge.org/wiki/wiki.pl?File-Find

Regards,

Dan

Brian_Wallace · 16 September 2009 13:51

Robert,

Hrrm - A simple solution of course ... and I've spent all this time trying
to figure it out on my own , without asking for help!

You've helped me in the past with a few other problems, and you were spot on
each time ..

I envy you, and hope that I am able to become even as half as knowledgeable
as you are..

Best Regards,

Brian

···

On Wed, Sep 16, 2009 at 2:51 AM, Robert Klemme <shortcutter@googlemail.com>wrote:

2009/9/15 Brian Wallace <draygen80@gmail.com>:
> Hi all,
>
> I'm working on a script that's currently using Find.find to process a
> complete directory tree of files and directories .. .however I needed
> specific behavior, and I'm still fairly new to this...
>
> Basically it works like this, the user specifies the root directory of a
> collection of sub-directories we are interested in .. for instance:
>
> User specifies: C:\Root
>
> C:\Root
> >
> -->\Dir1
> >
> -->Files_in_dir1
> >
> -->\Dir2
> >
> -->Files_in_dir2
>
>
> But we are only interested in the _files_ in \Dir1 , \Dir2 ..and want to
> essentially Prune recursion into C:\Root\Dir1\Subdir_of_Dir1 , and
> C:\Root\Dir2\Subdir_of_Dir2 if those exist ..
>
> This is what I am currently using, however it will pick up the sub
> directories of Dir1 and Dir2 and update the hash... Does anyone see any
way
> this could be refactored to not update the "process_list" hash with the
sub
> directories of Dir1 and Dir2?
>
> --
>
> @root_dir = "C:\Root"
>
> process_list = Hash.new {|h,k| h[k] = }
>
> Find.find(@root_dir.to_s) do |f|
> f = f.gsub(/\\/,'/')
> next if File.stat(f).directory? or f.include?('skipped_filename')
> Find.prune if f.include?("skipped_dirname")
> d, b = File.split(f)
> process_list [d] << b
> end
>
> Thanks in advance!

require 'no gems' # silly joke

Dir["#{@root_dir}/*/*"].each do |f|
next unless File.file? f
d, b = File.split f
process_list[d] << b
end

Kind regards

robert

--
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/

Daniel_Berger2 · 16 September 2009 15:41

There's a JRuby specific gem for file-find. A simple "gem install file-
find" should work, but if it doesn't try "gem install file-find --
platform=java" and see if that works.

Regards,

Dan

···

On Sep 16, 7:43 am, Brian Wallace <drayge...@gmail.com> wrote:

Thanks Dan,

That would actually work out OK - however since i'm using JRuby I wouldn't
be able to use a gem with native C extensions..

Thanks for the suggestion!

Robert_K1 · 16 September 2009 19:55

Hrrm - A simple solution of course ... and I've spent all this time trying
to figure it out on my own , without asking for help!

You've helped me in the past with a few other problems, and you were spot on
each time ..

Great to hear that I could provide helpful insights.

I envy you, and hope that I am able to become even as half as knowledgeable
as you are..

You sure will - you practically cannot avoid it: experience takes time - and it automatically comes with time.

Btw, it's the mistakes we learn from.

Kind regards

robert

···

On 16.09.2009 15:51, Brian Wallace wrote:

--
remember.guy do |as, often| as.you_can - without end
http://blog.rubybestpractices.com/

Topic		Replies	Views
Go through directories recursively ruby-talk	16	158	13 May 2005
Why? ruby-talk	11	79	7 November 2005
[ANN] file-find 0.1.0 ruby-talk	17	140	22 May 2007
Getting a directory tree ruby-talk	9	70	23 September 2002
Find.prune if ( filename[-5,5] != ".java") is odd? ruby-talk	2	141	25 April 2008

Find.find, limited recursion

Related topics