Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can't Bio.pairwise2 align two very long sequences(50kbp and 100kbp)? #4589

Open
tanger-code opened this issue Jan 18, 2024 · 7 comments
Open

Comments

@tanger-code
Copy link

Hi.
I am aligning two long sequence(seq1 is 5wbp, seq2 is 10wbp) using Bio.pairwise2, but there is an error:
Process finished with exit code -1073741819 (0xC0000005)

Can't Bio.pairwise2 align such long sequence? Or is there other method to do this in Bio?

Any advice would help me a lot.

@MarkusPiotrowski
Copy link
Contributor

Sorry for my ignorance, but what is 'w' standing for?

@tanger-code
Copy link
Author

Sorry for my ignorance, but what is 'w' standing for?

Sorry, it means 50,000bp and 100,000bp

@MarkusPiotrowski
Copy link
Contributor

Yes, that's probably too long, although I'm not sure if the error messages that you see is indicative for a memory overflow.

However, please use the 'new' PairwiseAligner, as described here: https://biopython.org/DIST/docs/tutorial/Tutorial.html#sec127

@tanger-code
Copy link
Author

Ok, I will try it.
Thank you.

@peterjc peterjc changed the title Can't Bio.pairwise2 align two very long sequences(5wbp and 10wbp)? Can't Bio.pairwise2 align two very long sequences(50kbp and 100kbp)? Jan 18, 2024
@peterjc
Copy link
Member

peterjc commented Jan 18, 2024

I'd have written this as 50kbp and 100kbp, with k for kilo meaning a thousand from the SI units https://en.wikipedia.org/wiki/Metric_prefix

Was the "w" for 10,000 from another language perhaps? I know Japanese use this unit a lot.

@tanger-code
Copy link
Author

I'd have written this as 50kbp and 100kbp, with k for kilo meaning a thousand from the SI units https://en.wikipedia.org/wiki/Metric_prefix

Was the "w" for 10,000 from another language perhaps? I know Japanese use this unit a lot.

Haha, you guessed close, I'm Chinese.

@mdehoon
Copy link
Contributor

mdehoon commented Feb 7, 2024

@tanger-code Were you able to use the PairwiseAligner for this problem? It will require a lot of memory (more than 50000 * 100000 bytes).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants